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COMPOSITIONS FOR USE IN IDENTIFICATION OF BACTERIA 
CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] The present application cLaims the benefit of priority to: U.S. Provisional Application 
Serial No. 60/545,425 filed February 18, 2004, U.S. Provisional Application Serial No. 
60/559,754, filed April 5, 2004, U.S. Provisional Application Serial No. 60/632,862, filed 
December 3, 2004, U.S. Provisional Application Serial No. 60/639,068, filed December 22, 
2004, and U.S. Provisional Application Serial No. 60/648,188, filed January 28, 2005, each of 
which is incorporated herein by reference in its entirety. 

STATEMENT OF GOVERNMENT SUPPORT 

[0002] This invention was made -with United States Government support under DARPA/SPO 
contract BAA00-09. The United States Government may have certain rights in the invention. 

FIELD OF THE INVENTION 

[0003] The present invention relates generally to the field of genetic identification of bacteria 
and provides nucleic acid compositions and kits useful for this purpose when combined with 
molecular mass analysis. 

BACKGROUND OF THE INVENTION 

[0004] A problem in determining the cause of a natural infectious outbreak or a bioterrorist 
attack is the sheer variety of organisms that can cause human disease. There are over 1400 
organisms infectious to humans; many of these have the potential to emerge suddenly in a 
natural epidemic or to be used in a malicious attack by bioterrorists (Taylor et al. Philos. Trans. 
R. Soc. London B. Biol. Sci., 20O1, 356, 983-989). This number does not include numerous 
strain variants, bioengineered versions, or pathogens that infect plants or animals. 

[0005] Much of the new technology being developed for detection of biological weapons 
incorporates a polymerase chain reaction (PCR) step based upon the use of highly specific 
primers and probes designed to selectively detect certain pathogenic organisms. Although this 
approach is appropriate for the most obvious bioterrorist organisms, like smallpox and anthrax, 
experience has shown that it is very difficult to predict which of hundreds of possible pathogenic 
organisms might be employed in a terrorist attack. Likewise, naturally emerging human disease 
that has caused devastating consequence in public health has come from unexpected families of 
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bacteria, viruses, fungi, or protozoa. Plants and animals also have their natural burden of 
infectious disease agents and there are equally important biosafety and security concerns for 
agriculture. 

[0006] A major conundrum in public health protection, biodefense, and agricultural safety and 
security is that these disciplines need to be able to rapidly identify and characterize infectious 
agents, while there is no existing technology with the breadth of function to meet this need. 
Currently used methods for identification of bacteria rely upon culturing the bacterium to effect 
isolation from other organisms and to obtain sufficient quantities of nucleic acid followed by 
sequencing of the nucleic acid, both processes which are time and labor intensive. 

[0007] Mass spectrometry provides detailed information about the molecules being analyzed, 
including high mass accuracy. It is also a process that can be easily automated. DNA chips with 
specific probes can only determine the presence or absence of specifically anticipated organisms. 
Because there are hundreds of thousands of species of benign bacteria, some very similar in 
sequence to threat organisms, even arrays with 10,000 probes lack the breadth needed to identify 
a particular organism. 

[0008] There is a need for a method for identification of bioagents which is both specific and 
rapid, and in which no culture or nucleic acid sequencing is required. Disclosed in U.S. Patent 
Application Serial Nos: 09/798,007, 09/891,793, 10/405,75"6, 10/418,514, 10/660,997, 
10/660,122, 10/660,996, 10/728,486, 10/754,415 and 10/829,826, each of which is commonly 
owned and incorporated herein by reference in its entirety, are methods for identification of 
bioagents (any organism, cell, or virus, living or dead, or a nucleic acid derived from such an 
organism, cell or virus) in an unbiased manner by molecular mass and base composition analysis 
of "bioagent identifying amplicons" which are obtained by amplification of segments of essential 
and conserved genes which are involved in, for example, translation, replication, recombination 
and repair, transcription, nucleotide metabolism, amino acid metabolism, lipid metabolism, 
energy generation, uptake, secretion and the like. Examples of these proteins include, but are not 
limited to, ribosomal RNAs, ribosomal proteins, DNA and RNA polymerases, elongation 
factors, tRNA synthetases, protein chain initiation factors, heat shock protein groEL, 
phosphoglycerate kinase, NADH dehydrogenase, DNA ligases, DNA gyrases and DNA 
topoisomerases, metabolic enzymes, and the like. 
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[0009] To obtain bioagent identifying amplicons, primers are selected to hybridize to conserved 
sequence regions which bracket variable sequence regions to yield a segment of nucleic acid 
which can be amplified and which is amenable to methods of molecular mass analysis. The 
variable sequence regions provide the variability of molecular mass which is used for bioagent 
identification. Upon amplification by PCR or other amplification methods with the specifically 
chosen primers, an amplification product that represents a bioagent identifying amplicon is 
obtained. The molecular mass of the amplification product, obtained by mass spectrometry for 
example, provides the means to uniquely identify the bioagent without a requirement for prior 
knowledge of the possible identity of the bioagent. The molecular mass of the amplification 
product or the corresponding base composition (which can be calculated from the molecular 
mass of the amplification product) is compared with a database of molecular masses or base 
compositions and a match indicates the identity of the bioagent. Furthermore, the method can be 
applied to rapid parallel analyses (for example, in a multi-well plate format) the results of which 
can be employed in a triangulation identification strategy which is amenable to rapid throughput 
and does not require nucleic acid sequencing of the amplified target sequence for bioagent 
identification. 

[0010] The result of determination of a previously unknown base composition of a previously 
unknown bioagent (for example, a newly evolved and heretofore unobserved bacterium or virus) 
has downstream utility by providing new bioagent indexing information with which to populate 
base composition databases. The process of subsequent bioagent identification analyses is thus 
greatly improved as more base composition data for bioagent identifying amplicons becomes 
available. 

[0011] The present invention provides oligonucleotide primers and compositions and kits 
containing the oligonucleotide primers, which define bacterial bioagent identifying amplicons 
and, upon amplification, produce corresponding amplification products whose molecular masses 
provide the means to identify bacteria, for example, at and below the species taxonomic level. 

SUMMARY OF THE INVENTION 

[0012] The present invention provides primers and compositions comprising pairs of primers, 
and kits containing the same for use in identification of bacteria. The primers are designed to 
produce bacterial bioagent identifying amplicons of DNA encoding genes essential to life such 
as, for example, 16S and 23 S rRNA, DNA-directed RNA polymerase subunits (rpoB and rpoC), 
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valyl-tRNA synthetase (valS), elongation factor EF-Tu (TufB), ribosomal protein L2 (rplB), 
protein chain initiation factor (infB), and spore protein (sspE). The invention further provides 
drill-down primers, compositions comprising pairs of primers and kits containing the same, 
which are designed to provide sub-species characterization of bacteria. 

[O013] In particular, the present invention provides an oligonucleotide primer 16 to 35 
nucleobases in length comprising 80% to 100% sequence identity with SEQ ID NO: 26, or a 
composition comprising the same; an oligonucleotide primer 20 to 27 nucleobases in length 
comprising at least a 20 nucleobase portion of SEQ ID NO: 388, or a composition comprising 
the same; a composition comprising both primers; and a composition comprising a first 
oligonucleotide primer 15 to 35 nucleobases in length comprising between 70% to 100% 
sequence identity of SEQ ID NO: 26, and a second oligonucleotide primer 16 to 35 nucleobases 
in length comprising between 70% to 100% sequence identity of SEQ ID NO: 388. 

[O014] The present invention also provides an oligonucleotide primer 22 to 35 nucleobases in 
length comprising SEQ ID NO: 29, or a composition comprising the same; an oligonucleotide 
primer 18 to 35 nucleobases in length comprising SEQ ID NO: 391, or a composition comprising 
the same; a composition comprising both primers; and a composition comprising a first 
oligonucleotide primer 16 to 35 nucleobases in length comprising between 70% to 100% 
sequence identity of SEQ ID NO: 29, and a second oligonucleotide primer 13 to 35 nucleobases 
in length comprising between 70% to 100% sequence identity of SEQ ID NO: 391 . 

[0015] The present invention also provides an oligonucleotide primer 22 to 26 nucleobases in 
length comprising SEQ ID NO: 37, or a composition comprising the same; an oligonucleotide 
primer 20 to 30 nucleobases in length comprising SEQ ID NO: 362, or a composition comprising 
the same; a composition comprising both primers; and a composition comprising a first 
oligonucleotide primer 16 to 35 nucleobases in length comprising between 70% to 100% 
sequence identity of SEQ ID NO: 37, and a second oligonucleotide primer 14 to 35 nucleobases 
in length comprising between 70% to 100% sequence identity of SEQ ID NO: 362. 

[O016] The present invention also provides an oligonucleotide primer 13 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 48, or a composition 
comprising the same; an oligonucleotide primer 19 to 35 nucleobases in length comprising SEQ 
ID NO: 404, or a composition comprising the same; a composition comprising both primers; and 
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a composition comprising a first oligonucleotide primer 13 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 48, and a second 
oligonucleotide primer 14 to 35 nucleobases in length comprising between 70% to 100% 
sequence identity of SEQ ID NO: 404. 

[0017] The present invention also provides an oligonucleotide primer 21 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 160, or a composition 
comprising the same; an oligonucleotide primer 21 to 35 nucleobases in length comprising at 
least a 16 nucleobase portion of SEQ ID NO: 515, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 21 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 160, and a second oligonucleotide primer 21 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 515. 

[0018J The present invention also provides an oligonucleotide primer 17 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 261, or a composition 
comprising the same; an oligonucleotide primer 18 to 35 nucleobases in length comprising at 
least a 16 nucleobase portion of SEQ ID NO: 624, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 17 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 261, and a second oligonucleotide primer 18 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 624. 

[0019] The present invention also provides an oligonucleotide primer 21 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 231, or a composition 
comprising the same; an oligonucleotide primer 17 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 591; , or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 21 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 231, and a second oligonucleotide primer 17 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 591. 

[0020] The present invention also provides an oligonucleotide primer 14 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 349, or a composition 
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comprising the same; an oligonucleotide primer 17 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 71 1, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 14 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 349, and a second oligonucleotide primer 17 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 711. 

[0021] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 240, or a composition 
comprising the same; an oligonucleotide primer 15 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 596, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 240, and a second oligonucleotide primer 15 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 596. 

[0022] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 58, or a composition 
comprising the same; an oligonucleotide primer 21 to 35 nucleobases in length comprising at 
least a 16 nucleobase portion of SEQ ID NO:414, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 58, and a second oligonucleotide primer 15 to 35 nucleobases in length comprising 
between 70% to 100% sequence identity of SEQ ID NO: 414. 

[0023] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising at least a 16 nucleobase portion of SEQ ID NO: 6, or a composition 
comprising the same; an oligonucleotide primer 16 to 35 nucleobases in length comprising at 
least a 16 nucleobase portion of SEQ ID NO:369, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 6, and a second oligonucleotide primer 15 to 35 nucleobases in length comprising 
between 70% to 100% sequence identity of SEQ ID NO: 369. 
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[0024] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 246, or a composition 
comprising the same; an oligonucleotide primer 19 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 602, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 246, and a second oligonucleotide primer 19 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 602. 

[0025] The present invention also provides an oligonucleotide primer 21 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 256, or a composition 
comprising the same; an oligonucleotide primer 14 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 620, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 21 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 256, and a second oligonucleotide primer 14 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 620. 

[0026] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 344, or a composition 
comprising the same; an oligonucleotide primer 18 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 700, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 344, and a second oligonucleotide primer 18 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 700. 

[0027] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 235, or a composition 
comprising the same; an oligonucleotide primer 16 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 587, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
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SEQ ID NO: 235, and a second oligonucleotide primer 16 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 587. 

[0028] The present invention also provides an oligonucleotide primer 16 to 35 nucleobases in 
length comprising 70% to 100% sequence identity with SEQ ID NO: 322, or a composition 
comprising the same; an oligonucleotide primer 19 to 35 nucleobases in length comprising 70% 
to 100% sequence identity with SEQ ID NO: 686, or a composition comprising the same; a 
composition comprising both primers; and a composition comprising a first oligonucleotide 
primer 16 to 35 nucleobases in length comprising between 70% to 100% sequence identity of 
SEQ ID NO: 322, and a second oligonucleotide primer 19 to 35 nucleobases in length 
comprising between 70% to 100% sequence identity of SEQ ID NO: 686. 

[0029] The present invention also provides compositions, such as those described herein, 
wherein either or both of the first and second oligonucleotide primers comprise at least one 
modified nucleobase, a non-tcmplated T residue on the 5'-end, at least one non-template tag, or 
at least one molecular mass modifying tag, or any combination thereof. 

[0030] The present invention also provides kits comprising any of the compositions described 
herein. The kits can comprise at least one calibration polynucleotide, or at least one ion exchange 
resin linked to magnetic beads, or both. 

[0031] The present invention also provides methods for identification of an unknown bacterium. 
Nucleic acid from the bacterium is amplified using any of the compositions described herein to 
obtain an amplification product. The molecular mass of the amplification product is determined. 
Optionally, the base composition of the amplification product is determined from the molecular 
mass. The base composition or molecular mass is compared with a plurality of base 
compositions or molecular masses of known bacterial bioagent identifying amplicons, wherein a 
match between the base composition or molecular mass and a member of the plurality of base 
compositions or molecular masses identifies the unknown bacterium. The molecular mass can be 
measured by mass spectrometry. In addition, the prrsence or absence of a particular clade, genus, 
species, or sub-species of a bioagent can be determined by the methods described herein. 

[0032] The present invention also provides methods for determination of the quantity of an 
unknown bacterium in a sample. The sample is contacted with any of the compositions described 
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herein and a known quantity of a calibration polynucleotide comprising a calibration sequence. 
Concurrently, nucleic acid from the bacterium in the sample is amplified with any of the 
compositions described herein and nucleic acid from the calibration polynucleotide in the sample 
is amplified with any of the compositions described herein to obtain a first amplification product 
comprising a bacterial bioagent identifying amplicon and a second amplification product 
comprising a calibration amplicon. The molecular mass and abundance for the bacterial bioagent 
identifying amplicon and the calibration amplicon is determined. The bacterial bioagent 
identifying amplicon is distinguished from the calibration amplicon based on molecular mass, 
wherein comparison of bacterial bioagent identifying amplicon abundance and calibration 
amplicon abundance indicates the quantity of bacterium in the sample. The method can also 
comprise determining the base composition of the bacterial bioagent identifying amplicon. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0033] Figure 1 is a represenataive pseudo-four dimensional plot of base compositions of 
bioagent identifying amplicons of enterobacteria obtained with a primer pair targeting the rpoB 
gene (primer pair no 14 (SEQ ID NOs: 37:362). The quantity each of the nucleobases A, G and 
C are represented on the three axes of the plot while the quantity of nucleobase T is represented 
by the diameter of the spheres. Base composition probability clouds surrounding the spheres are 
also shown. 

[0034] Figure 2 is a represenataive diagram illustrating the primer selection process. 

[0035] Figure 3 lists common pathogenic bacteria and primer pair coverage. The primer pair 
number in the upper right hand corner of each polygon indicates that the primer pair can produce 
a bioagent identifying amplicon for all species within that polygon. 

[0036] Figure 4 is a represenataive 3D diagram of base composition (axes A, G and C) of 
bioagent identifying amplicons obtained with primer pair number 14 (a precursor of primer pair 
number 348 which targets 16S rRNA). The diagram indicates that the experimentally determined 
base compositions of the clinical samples (labeled NHRC samples) closely match the base 
compositions expected for Streptococcus pyogenes and are distinct from the expected base 
compositions of other organisms. 
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[0037] Figure 5 is a represenataive mass spectrum of amplification products representing 
bioagent identifying amplicons of Streptococcus pyogenes, Neisseria meningitidis, and 
Haemophilus influenzae obtained from amplification of nucleic acid from a clinical sample with 
primer pair number 349 which targets 23S rRNA. Experimentally determined molecular masses 
and base compositions for the sense strand of each amplification product are shown. 

[0038] Figure 6 is a represenataive mass spectrum of amplification products representing a 
bioagent identifying amplicon of Streptococcus pyogenes, and a calibration amplicon obtained 
from amplification of nucleic acid from a clinical sample with primer pair number 356 which 
targets rplB. The experimentally determined molecular mass and base composition for the sense 
strand of the Streptococcus pyogenes amplification product is shown. 

[0039] Figure 7 is a represenataive process diagram for identification and determination of the 
quantity of a bioagent in a sample. 

[0040] Figure 8 is a represenataive mass spectrum of an amplified nucleic acid mixture which 
contained the Arnes strain of Bacillus anthracis, a known quantity of combination calibration 
polynucleotide (SEQ ID NO: 741), and primer pair number 350 which targets the capC gene on 
the virulence plasmid pX02 of Bacillus anthracis. Calibration amplicons produced in the 
amplification reaction are visible in the mass spectrum as indicated and abundance data (peak 
height) are used to calculate the quantity of the Ames strain of Bacillus anthracis. 

DESCRIPTION OF EMBODIMENTS 

[0041] The present invention provides oligonucleotide primers which hybridize to conserved 
regions of nucleic acid of genes encoding, for example, proteins or RNAs necessary for life 
which include, but are not limited to: 16S and 23S rRNAs, RNA polymerase subunits, t-RNA 
synthetases, elongation factors, ribosomal proteins, protein chain initiation factors, cell division 
proteins, chaperorrin groEL, chaperonin dnaK, phosphoglycerate kinase, NADH dehydrogenase, 
DNA ligases, metabolic enzymes and DNA topoisomerases. These primers provide the 
functionality of producing, for example, bacterial bioagent identifying amplicons for general 
identification of bacteria at the species level, for example, when contacted with bacterial nucleic 
acid under amplification conditions. 
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[0042] Referring to Figure 2, primers are designed as follows: for each group of organisms, 
candidate target sequences are identified (200) from which nucleotide alignments are created 
(210) and analyzed (220). Primers are designed by selecting appropriate priming regions (230) 
which allows the selection of candidate primer pairs (240). The primer pairs are subjected to in 
silico analysis by electronic PGR (ePCR) (300) wherein bioagent identifying amplicons are 
obtained from sequence databases such as, for example, GenBank or other sequence collections 
(310), and checked for specificity in silico (320). Bioagent identifying amplicons obtained from 
GenBank sequences (310) can also be analyzed by a probability model which predicts the 
capability of a particular amplicon to identify unknown bioagents such that the base 
compositions of amplicons smth. favorable probability scores are stored in a base composition 
database (325). Alternatively, base compositions of the bioagent identifying amplicons obtained 
from the primers and GenBank sequences can be directly entered into the base composition 
database (330). Candidate primer pairs (240) are validated by in vitro amplification by a method 
such as, for example, PCR analysis (400) of nucleic acid from a collection of organisms (410). 
Amplification products that are obtained are optionally analyzed to confirm the sensitivity, 
specificity and reproducibility of the primers used to obtain the amplification products (420). 

[0043] Synthesis of primers is well known and routine in the art. The primers may be 
conveniently and routinely made through the well-known technique of solid phase synthesis. 
Equipment for such synthesis is sold by several vendors including, for example, Applied 
Biosystems (Foster City, CA). Any other means for such synthesis known in the art may 
additionally or alternatively be employed. 

[0044] The primers can be employed as compositions for use in, for example, methods for 
identification of bacterial bioagents as follows. In some embodiments, a primer pair composition 
is contacted with nucleic acid of an unknown bacterial bioagent. The nucleic acid is amplified by 
a nucleic acid amplification technique, such as PCR for example, to obtain an amplification 
product that represents a bioagent identifying amplicon. The molecular mass of one strand or 
each strand of the double-stranded amplification product is determined by a molecular mass 
measurement technique sucli as, for example, mass spectrometry wherein the two strands of the 
double-stranded amplification product are separated during the ionization process. In some 
embodiments, the mass spectrometry is electrospray Fourier transform ion cyclotron resonance 
mass spectrometry (ESI-FTICR-MS) or electrospray time of flight mass spectrometry (ESI-TOF- 
MS). A list of possible base compositions can be generated for the molecular mass value 
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obtained for each strand and the choice of the correct base composition from the list is facilitated 
by matching the base composition of one strand with a complementary base composition of the 
other strand. The molecular mass or base composition thus determined is compared with a 
database of molecular masses or base compositions of analogous bioagent identifying amplicons 
for known bacterial bioagents. A match between the molecular mass or base composition of the 
amplification product from the unknown bacterial bioagent and the molecular mass or base 
composition of an analogous bioagent identifying amplicon for a known bacterial bioagent 
indicates the identity of the unknown bioagent. 

[0045] In some embodiments, the primer pair used is one of the primer pairs of Table 1. In some 
embodiments, the method is repeated using a different primer pair to resolve possible 
ambiguities in the identification process or to improve the confidence level for the identification 
assignment. 

[0046] In some embodiments, a bioagent identifying amplicon may be produced using only a 
single primer (either the forward or reverse primer of any given primer pair), provided an 
appropriate amplification method is chosen, such as, for example, low stringency single primer 
PCR (LSSP-PCR). Adaptation of this ampl ification method in order to produce bioagent 
identifying amplicons can be accomplished by one with ordinary skill in the art without undue 
experimentation. 

[0047] In some embodiments, the oligonucleotide primers are "broad range survey primers" 
which hybridize to conserved regions of nucleic acid encoding RNA, such as ribosomal RNA 
(rRNA), of all, or at least 70%, at least 80%, at least 85%, at least 90%, or at least 95% of known 
bacteria and produce bacterial bioagent identifying amplicons. As used herein, the term "broad 
range survey primers" refers to primers that bind to nucleic acid encoding rRNAs of all, or at 
least 70%, at least 80%, at least 85%, at least 90%, or at least 95% known species of bacteria. In 
some embodiments, the rRNAs to which the primers hybridize are 16S and 23S rRNAs. In some 
embodiments, the broad range survey primer pairs comprise oligonucleotides ranging in length 
from 13 to 35 nucleobases, each of which have from 70% to 100% sequence identity with primer 
pair numbers 3, 10, 1 1, 14, 16, and 17 which consecutively correspond to SEQ ID NOs: 6:369, 
26:388, 29:391, 37:362, 48:404, and 58:414. 
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[0048] In some cases, the molecular mass or base composition of a bacterial bioagent identifying 
amplicon defined by a broad range survey primer pair does not provide enough resolution to 
unambiguously identify a bacterial bioagent at the species level. These cases benefit from further 
analysis of one or more bacterial bioagent identifying amplicons generated from at least one 
additional broad range survey primer pair or from at least one additional "division-wide" primer 
pair (vide infra). The employment of more than one bioagent identifying amplicon for 
identification of a bioagent is herein referred to as "triangulation identification" (vide infra). 

[0049] In other embodiments, the oligonucleotide primers are "division-wide" primers which 
hybridize to nucleic acid encoding genes of broad divisions of bacteria such as, for example, 
members of the Bacillus/Clostridia group or members of the a-, p-, y-, and e-proteobacteria. In 
some embodiments, a division of bacteria comprises any grouping of bacterial genera with more 
than one genus represented. For example, the P-proteobacteria group comprises members of the 
following genera: Eikenella, Neisseria, Achromobacter, Bordetella, Burkholderia, and Raltsonia. 
Species members of these genera can be identified using "bacterial bioagent identifying amplicons 
generated with primer pair 293 (SEQ ID NOs: 344:700) which produces a bacterial bioagent 
identifying amplicon from the tuffi gene of p-proteobacteria. Examples of genes to which 
division-wide primers may hybridize to include, but are n ot limited to: RNA polymerase 
subunits such as rpoB and rpoC, tRNA synthetases such as valyl-tRNA synthetase (valS) and 
aspartyl-tRNA synthetase (aspS), elongation factors such as elongation factor EF-Tu (tufB), 
ribosomal proteins such as ribosomal protein L2 (rplB), protein chain initiation factors such as 
protein chain initiation factor infB, chaperonins such as grroL and dnaK, and cell division 
proteins such as peptidase ftsH (hflB). In some embodiments, the division-wide primer pairs 
comprise oligonucleotides ranging in length from 13 to 35 nucleobases, each of which have from 
70% to 100% sequence identity with primer pair numbers 34, 52, 66, 67, 71, 72, 289, 290 and 
293 which consecutively correspond to SEQ ID NOs: 160:515, 261:624, 231:591, 235:587, 
349:711, 240:596, 246:602, 256:620, 344:700. 

[0050] In other embodiments, the oligonucleotide primers are designed to enable the 
identification of bacteria at the clade group level, which is a monophyletic taxon referring to a 
group of organisms which includes the most recent common ancestor of all of its members and 
all of the descendants of that most recent common ancestor. The Bacillus cereus clade is an 
example of a bacterial clade group. In some embodiments, the clade group primer pairs comprise 
oligonucleotides ranging in length from 13 to 35 nucleobases, each of which have from 70% to 
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100% sequence identity with primer pair number 58 which corresponds to SEQ ID NOs: 
322:686. 

[0051] In other embodiments, the oligonucleotide primers aire "drill-down" primers which enable 
the identification of species or "sub-species characteristics." Sub-species characteristics are 
herein defined as genetic characteristics that provide the means to distinguish two members of 
the same bacterial species. For example, Escherichia coli OL57:H7 and Escherichia coli K12 are 
two well known members of the species Escherichia coli. Escherichia coli 0157:H7, however, is 
highly toxic due to the its Shiga toxin gene which is an example of a sub-species characteristic. 
Examples of sub-species characteristics may also include, bmt are not limited to: variations in 
genes such as single nucleotide polymorphisms (SNPs), variable number tandem repeats 
(VNTRs). Examples of genes indicating sub-species characteristics include, but are not limited 
to, housekeeping genes, toxin genes, pathogenicity markers, antibiotic resistance genes and 
virulence factors. Drill-down primers provide the functionality of producing bacterial bioagent 
identifying amplicons for drill-down analyses such as strain typing when contacted with bacterial 
nucleic acid under amplification conditions. Identification of such sub-species characteristics is 
often critical for determining proper clinical treatment of bacterial infections. Examples of pairs 
of drill-down primers include, but are not limited to, a trio of primer pairs for identification of 
strains of Bacillus anthracis. Primer pair 24 (SEQ ID NOs: 97:451) targets the capC gene of 
virulence plasmid pX02, primer pair 30 (SEQ ID NOs: 127:482) targets the cyA gene of 
virulence plasmid pX02, and primer pair 37 (SEQ ID NOs: 174:530) targets the lef gene of 
virulence plasmid pX02. Additional examples of drill-down primers include, but are not limited 
to, six primer pairs that are used for determining the strain type of group A Streptococcus. 
Primer pair 80 (SEQ ID NOs: 310:668) targets the gki gene, primer pair 81 (SEQ ID NOs: 
3 1 3 :670) targets the gtr gene, primer pair 86 (SEQ ID NOs z 227:632) targets the murl gene, 
primer pair 90 (SEQ ID NOs: 285:640) targets the mutS gene, primer pair 96 (SEQ ID NOs: 
301 :656) targets the xpt gene, and primer pair 98 (SEQ ID NTOs: 308:663) targets the yqiL gene. 

[0052] In some embodiments, the primers used for amplification hybridize to and amplify 
genomic DNA, DNA of bacterial plasmids, or DNA of DNA. viruses. 

[0053] In some embodiments, the primers used for amplification hybridize directly to ribosomal 
RNA or messenger RNA (mRNA) and act as reverse transcription primers for obtaining DNA 
from direct amplification of bacterial RNA or rRNA. Methods of amplifying RNA using reverse 
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transcriptase are well known to those with ordinary skill in the art and can be routinely 
established without undue experimentation. 

[0054] One with ordinary skill in the art of design of amplification primers will recognize that a 
given primer need not hybridize with 100% complementarity in order to effectively prime the 
synthesis of a complementary nucleic acid strand in an amplification reaction. Moreover, a 
primer may hybridize over one or more segments such that intervening or adjacent segments are 
not involved in the hybridization event (e.g., a loop structure or a hairpin structure). The primers 
of the present invention may comprise at least 70%, at least 75%, at least 80%, at least 85%, at 
least 90%, at least 95% or at least 99% sequence identity with any of the primers listed in Table 
1. Thus, in some embodiments of the present invention, an extent of variation of 70% to 100%, 
or any range therewithin, of the sequence identity is possible relative to the specific primer 
sequences disclosed herein. Determination of sequence identity is described in the following 
example: a primer 20 nucleobases in length which is otherwise identical to another 20 
nucleobase primer but having two non-identical residues has 1 8 of 20 identical residues (18/20 = 
0.9 or 90% sequence identity). In another example, a primer 15 nucleobases in length having all 
residues identical to a 15 nucleobase segment of primer 20 nucleobases in length would have 
15/20 = 0.75 or 75% sequence identity with the 20 nucleobase primer. 

[0055] Percent homology, sequence identity or complementarity, can be determined by, for 
example, the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics 
Computer Group, University Research Park, Madison WI), using default settings, which uses the 
algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489). In some embodiments, 
homology, sequence identity, or complementarity of primers with respect to "the conserved 
priming regions of bacterial nucleic acid, is at least 70%, at least 80%, at least 90%, at least 92%, 
at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or is 100%. 

[0056] In some embodiments, the primers described herein comprise at least 70%, at least 75%, 
at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 95%, at least 96%, at 
least 98%, or at least 99%, or 100% (or any range therewithin) sequence identity with the primer 
sequences specifically disclosed herein. Thus, for example, a primer may hax/c between 70% and 
100%, between 75% and 100%, between 80% and 100%, and between 95% and 100% sequence 
identity with SEQ ID NO: 26. Likewise, a primer may have similar sequence identity with any 
other primer whose nucleotide sequence is disclosed herein. 
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[0057] One with ordinary skill is able to calculate percent sequence identity or percent sequence 
homology and able to determine, without undue experimentation, the effects of variation of 
primer sequence identity on the function of the primer in its role in priming synthesis of a 
complementary strand of nucleic acid for production of an amplification product of a 
corresponding bioagent identifying amplicon. 

[0058] In some embodiments of the present invention, the oligonucleotide primers are between 
13 and 35 nucleobases in length (13 to 35 linked nucleotide residues). These embodiments 
comprise oligonucleotide primers 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34 or 35 nucleobases in length, or any range therewithin. 

[0059] In some embodiments, any given primer comprises a modification comprising the 
addition of a non-templated T residue to the 5' end of the primer (i.e., the added T residue does 
not necessarily hybridize to the nucleic acid being amplified). The addition of a non-templated T 
residue has an effect of minimizing the addition of non-templated A residues as a result of the 
non-specific enzyme activity of Taq polymerase (Magnuson et al. Biotechniques, 1996, 21, 700- 
709), an occurrence which may lead to ambiguous results arising from molecular mass analysis. 

[0060] In some embodiments of the present invention, primers may contain one or more 
universal bases. Because any variation (due to codon wobble in the 3 rd position) in the conserved 
regions among species is likely to occur in the third position of a DNA triplet, oligonucleotide 
primers can be designed such that the nucleotide corresponding to this position is a base whiich 
can bind to more than one nucleotide, referred to herein as a "universal nucleobase." For 
example, under this "wobble" pairing, inosine (I) binds to U, C or A; guanine (G) binds to XJ or 
C, and uridine (U) binds to U or C. Other examples of universal nucleobases include nitroindolcs 
such as 5-nitroindole or 3-nitropyrrole (Loakes et al., Nucleosides and Nucleotides, 1995, L 4, 
1001-1003), the degenerate nucleotides dP or dK (Hill et al.), an acyclic nucleoside analog 
containing 5-nitroindazole (Van Aerschot et al., Nucleosides and Nucleotides, 1995, 14, 1053- 
1056) or the purine analog l-(2-deoxy-(3-D-ribofuranosyl)-imidazole-4-carboxamide (Sala et al., 
Nucl. Acids Res., 1996, 24, 3302-3306). 

[0061] In some embodiments, to compensate for the somewhat weaker binding by the "wobble" 
base, the oligonucleotide primers are designed such that the first and second positions of eaxh 
triplet are occupied by nucleotide analogs which bind with greater affinity than the unmodified 
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nucleotide. Examples of these analogs include, but are not limited to, 2,6-diaminopurine which 
binds to thymine, 5-propynyluracil which binds to adenine and 5-propynylcytosine and 
phenoxazines, including G-clamp, which binds to G. Propynylated pyrimidines are described in 
U.S. Patent Nos. 5,645,985, 5,830,653 and 5,484,908, each of which is commonly owned and 
incorporated herein by reference in its entirety. Propynylated primers are described in U.S Serial 
No. 10/294,203 which is also commonly owned and incorporated herein by reference in entirety. 
Phenoxazines are described in U.S. Patent Nos. 5,502,177, 5,763,588, and 6,005,096, each of 
which is incorporated herein by reference in its entirety. G-clamps are described in U.S. Patent 
Nos. 6,007,992 and 6,028,183, each of which is incorporated herein by reference in its entirety. 

[0062] In some embodiments, non-template primer tags are used to increase the melting 
temperature (T m ) of a primer-template duplex in order to improve amplification efficiency. A 
non-template tag is at least three consecutive A or T nucleotide residues on a primer which are 
not complementary to the template. In any given non-template tag, A can be replaced by C or G 
and T can also be replaced by C or G. Although Watson-Crick hybridization is not expected to 
occur for a non-template tag relative to the template, the extra hydrogen bond in a G-C pair 
relative to a A-T pair confers increased stability of the primer-template duplex and improves 
amplification efficiency for subsequent cycles of amplification when the primers hybridize to 
strands synthesized in previous cycles. 

[0063] In other embodiments, propynylated tags may be used in a manner similar to that of the 
non-template tag, wherein two or more 5-propynylcytidine or 5-propynyluridine residues replace 
template matching residues on a primer. In other embodiments, a primer contains a modified 
internucleoside linkage such as a phosphorothioate linkage, for example. 

[0064] In some embodiments, the primers contain mass-modifying tags. Reducing the total 
number of possible base compositions of a nucleic acid of specific molecular weight provides a 
means of avoiding a persistent source of ambiguity in determination of base composition of 
amplification products. Addition of mass-modifying tags to certain nucleobases of a given 
primer will result in simplification of de novo determination of base composition of a given 
bioagent identifying amplicon {vide infra) from its molecular mass. 

[0065] In some embodiments of the present invention, the mass modified nucleobase comprises 
one or more of the following: for example, 7-deaza-2'-deoxyadenosine-5-triphosphate, 5-iodo-2'- 
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deoxyuridine-5 '-triphosphate, 5 -bromo-2'-deoxyuridine-5'-triphosphate, 5 -bromo-2'- 
deoxycytidine-5'-triphosphate, 5-iodo-2'-deoxycytidine-5'-1riphosphate, 5-hydroxy-2'- 
deoxyuridine-5 '-triphosphate, 4-thiothymidine-5'-triphosphate, 5-aza-2'-deoxyuridine-5'- 
triphosphate, 5-fluoro-2'-deoxyuridine-5'-triphosphate, 06-methyl-2'-deoxyguanosine-5'- 
triphosphate,N2-methyl-2'-deoxyguanosine-5 '-triphosphate, 8-oxo-2'-deoxyguanosine-5'- 
triphosphate or thiothymidine-5'-triphosphate. In some embodiments, the mass-modified 
nucleobase comprises 15 N or 13 C or both 15 N and 13 C. 

[0066] In some embodiments of the present invention, at least one bacterial nucleic acid segment 
is amplified in the process of identifying the bioagent. Thus, the nucleic acid segments that can 
be amplified by the primers disclosed herein and that provide enough variability to distinguish 
each individual bioagent and whose molecular masses are amenable to molecular mass 
determination are herein described as "bioagent identifying amplicons." The term "amplicon" as 
used herein, refers to a segment of a polynucleotide which is amplified in an amplification 
reaction. In some embodiments of the present invention, bioagent identifying amplicons 
comprise from about 45 to about 200 nucleobases (i.e. from about 45 to about 200 linked 
nucleosides), from about 60 to about 150 nucleobases, from about 75 to about 125 nucleobases. 
One of ordinary skill in the art will appreciate that the invention embodies compounds of 45, 46, 
47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 
73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 
99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 
118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 
137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 
156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 
175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 
194, 195, 196, 197, 198, 199, and 200 nucleobases in length, or any range therewithin. It is the 
combination of the portions of the bioagent nucleic acid segment to which the primers hybridize 
(hybridization sites) and the variable region between the primer hybridization sites that 
comprises the bioagent identifying amplicon. Since genetic data provide the underlying basis for 
identification of bioagents by the methods of the present invention, it is prudent to select 
segments of nucleic acids which ideally provide enough variability to distinguish each individual 
bioagent and whose molecular mass is amenable to molecular mass determination. 
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[0067] In some embodiments, bioagent identifying amplicons amenable to molecular mass 
determination which are produced by the primers described herein are either of a length, size or 
mass compatible with the particular mode of molecular mass determination or compatible with a 
means of providing a predictable fragmentation pattern in order to obtain predictable fragments 
of a length compatible with the particular mode of molecular mass determination. Such means 
of providing a predictable fragmentation pattern of an amplification product include, but are not 
limited to, cleavage with restriction enzymes or cleavage primers, for example. Methods of using 
restriction enzymes and cleavage primers are well known to those with ordinary skill in the art. 

[0068] In some embodiments, amplification products corresponding to bacterial bioagent 
identifying amplicons are obtained using the polymerase chain reaction (PGR) which is a routine 
method to those with ordinary skill in the molecular biology arts. Other amplification methods 
may be used such as ligase chain reaction (LCR), low-stringency single primer PGR, and 
multiple strand displacement amplification (MDA) which are also well known to those with 
ordinary skill. 

[0069] In the context of this invention, a "bioagent" is any organism, cell, or virus, living or 
dead, or a nucleic acid derived from such an organism, cell or virus. Examples of bioagents 
include, but are not limited, to cells, (including but not limited to human clinical samples, 
bacterial cells and other pathogens), viruses, fungi, protists, parasites, and pathogenicity markers 
(including but not limited to: pathogenicity islands, antibiotic resistance genes, virulence factors, 
toxin genes and other bioregulating compounds). Samples may be alive or dead or in a 
vegetative state (for example, vegetative bacteria or spores) and may be encapsulated or 
bioengineered. In the context of this invention, a "pathogen" is a bioagent which causes a disease 
or disorder. 

[0070] In the context of this invention, the term "unknown bioagent" may mean either: (i) a 
bioagent whose existence is known (such as the well known bacterial species Staphylococcus 
aureus for example) but which is not known to be in a sample to be analyzed, or (ii) a bioagent 
whose existence is not known (for example, the SARS coronavirus was unknown prior to April 
2003). For example, if the method for identification of coronaviruses disclosed in commonly 
owned U.S. Patent Serial No. 10/829,826 (incorporated herein by reference in its entirety) was to 
be employed prior to April 2003 to identify the SARS coronavirus in a clinical sample, both 
meanings of "unknown" bioagent are applicable since the SARS coronavirus was unknown to 
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science prior to April, 2003 and since it was not known what bioagent (in this case a 
coronavirus) was present in the sample. On the other hand, if the method of U.S. Patent Serial 
No. 10/829,826 was to be employed subsequent to April 2003 to identify the SARS coronavirus 
in a clinical sample, only the first meaning (i) of "unknown" bioagent would apply since the 
SARS coronavirus became known to science subsequent to April 2003 and since it was not 
known what bioagent was present in the sample. 

[0071] The employment of more than one bioagent identifying amplicon for identification of a 
bioagent is herein referred to as "triangulation identification." Triangulation identification is 
pursued by analyzing a plurality of bioagent identifying amplicons selected within multiple core 
genes. This process is used to reduce false negative and false positive signals, and enable 
reconstruction of the origin of hybrid or otherwise engineered bioagents. For example, 
identification of the three part toxin genes typical of B. anthracis (Bowen et al., J. Appl. 
Microbiol, 1999, 87, 270-278) in the absence of the expected signatures from the B. anthracis 
genome would suggest a genetic engineering event. 

[0072] In some embodiments, the triangulation identification process can be pursued by 
characterization of bioagent identifying amplicons in a massively parallel fashion using the 
polymerase chain reaction (PCR), such as multiplex PCR where multiple primers are employed 
in the same amplification reaction mixture, or PCR in multi-well plate fonnat wherein a different 
and unique pair of primers is used in multiple wells containing otherwise identical reaction 
mixtures. Such multiplex and multi-well PCR methods are well known to those with ordinary 
skill in the arts of rapid throughput amplification of nucleic acids. 

[0073] In some embodiments, the molecular mass of a particular bioagent identifying amplicon 
is determined by mass spectrometry. Mass spectrometry has several advantages, not the least of 
which is high bandwidth characterized by the ability to separate (and isolate) many molecular 
peaks across a broad range of mass to charge ratio (m/z). Thus, mass spectrometry is intrinsically 
a parallel detection scheme without the need for radioactive or fluorescent labels, since every 
amplification product is identified by its molecular mass. The current state of the art in mass 
spectrometry is such that less than femtomole quantities of material can be readily analyzed to 
afford information about the molecular contents of the sample. An accurate assessment of the 
molecular mass of the material can be quickly obtained, irrespective of whether the molecular 
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weight of the sample is several hundred, or in excess of one hundred thousand atomic mass units 
(amu) or Daltons. 

[0074] In some embodiments, intact molecular ions are generated from amplification products 
using one of a variety of ionization techniques to convert the sample to gas phase. These 
ionization methods include, but are not limited to, electrospray ionization (ES), matrix-assisted 
laser desorption ionization (MALDI) and fast atom bombardment (FAB). Upon ionization, 
several peaks are observed from one sample due to the formation of ions with different charges. 
Averaging the multiple readings of molecular mass obtained from a single mass spectrum affords 
an estimate of molecular mass of the bioagent identifying amplicon. Electrospray ionization 
mass spectrometry (ESI-MS) is particularly useful for very high molecular weight polymers such 
as proteins and nucleic acids having molecular weights greater than 10 kDa, since it yields a 
distribution of multiply-charged molecules of the sample without causing a significant amount of 
fragmentation. 

[0075] The mass detectors used in the methods of the present invention include, but are not 
limited to, Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS), time of 
flight (TOF), ion trap, quadrupole, magnetic sector, Q-TOF, and triple quadrupole. 

[0076] In some embodiments, conversion of molecular mass data to a base composition is useful 
for certain analyses. As used herein, a "base composition" is the exact number of each 
nucleobase (A, T, C and G). For example, amplification of nucleic acid of Neisseria meningitidis 
with a primer pair that produces an amplification product from nucleic acid of 23 S rRNA that 
has a molecular mass (sense strand) of 28480.75124, from which a base composition of A25 G27 
C22 T18 is assigned from a list of possible base compositions calculated from the molecular 
mass using standard known molecular masses of each of the four nucleobases. 

[0077] In some embodiments, assignment of base compositions to experimentally determined 
molecular masses is accomplished using "base composition probability clouds." Base 
compositions, like sequences, vary slightly from isolate to isolate within species. It is possible to 
manage this diversity by building "base composition probability clouds" around the composition 
constraints for each species. This permits identification of organisms in a fashion similar to 
sequence analysis. A "pseudo four-dimensional plot" (Figure 1) can be used to visualize the 
concept of base composition probability clouds. Optimal primer design requires optimal choice 
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of bioagent identifying amplicons and maximizes the separation between the base composition 
signatures of individual bioagents. Areas where clouds overlap indicate regions that may result 
in a misclassification, a problem which is overcome by a triangulation identification process 
using bioagent identifying amplicons not affected by overlap of base composition probability 
clouds. 

[0078] In some embodiments, base composition probability clouds provide the means for 
screening potential primer pairs in order to avoid potential misclassifications of base 
compositions. In other embodiments, base composition probability clouds provide the means for 
predicting the identity of a bioagent whose assigned base composition was not previously 
observed and/or indexed in a bioagent identifying amplicon base composition database due to 
evolutionary transitions in its nucleic acid sequence. Thus, in contrast to probe-based techniques, 
mass spectrometry determination of base composition does not require prior knowledge of the 
composition or sequence in order to make the measurement. 

[0079] The present invention provides bioagent classifying information similar to DNA 
sequencing and phylogenetic analysis at a level sufficient to identify a given bioagent. 
Furthermore, the process of determination of a previously unknown base composition for a given 
bioagent (for example, in a case where sequence information is unavailable) has downstream 
utility by providing additional bioagent indexing information with which to populate base 
composition databases. The process of future bioagent identification is thus greatly improved as 
more BCS indexes become available in base composition databases. 

[0080] In one embodiment, a sample comprising an unknown bioagent is contacted with a pair 
of primers which provide the means for amplification of nucleic acid from the bioagent, and a 
known quantity of a polynucleotide that comprises a calibration sequence. The nucleic acids of 
the bioagent and of the calibration sequence are amplified and the rate of amplification is 
reasonably assumed to be similar for the nucleic acid of the bioagent and of the calibration 
sequence. The amplification reaction then produces two amplification products: a bioagent 
identifying amplicon and a calibration amplicon. The bioagent identifying amplicon and the 
calibration amplicon should be distinguishable by molecular mass while being amplified at 
essentially the same rate. Effecting differential molecular masses can be accomplished by 
choosing as a calibration sequence, a representative bioagent identifying amplicon (from a 
specific species of bioagent) and performing, for example, a 2 to 8 nucleobase deletion or 
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insertion within the variable region between the two priming sites. The amplified sample 
containing the bioagent identifying amplicon and the calibration amplicon is then subjected to 
molecular mass analysis by mass spectrometry, for example. The resulting molecular mass 
analysis of the nucleic acid of the bioagent and of the calibration sequence provides molecular 
mass data and abundance data for the nucleic acid of the bioagent and of the calibration 
sequence. The molecular mass data obtained for the nucleic acid of the bioagent enables 
identification of the unknown bioagent and the abundance data enables calculation of the 
quantity of the bioagent, based on the knowledge of the quantity of calibration polynucleotide 
contacted with the sample. 

[0081] In some embodiments, the identity and quantity of a particular bioagent is determined 
using the process illustrated in Figure 7. For instance, to a sample containing nucleic acid of an 
unknown bioagent are added primers (500) and a known quantity of a calibration polynucleotide 
(505). The total nucleic acid in the sample is subjected to an amplification reaction (510) to 
obtain amplification products. The molecular masses of amplification products are determined 
(515) from which are obtained molecular mass and abundance data. The molecular mass of the 
bioagent identifying amplicon (520) provides the means for its identification (525) and the 
molecular mass of the calibration amplicon obtained from the calibration polynucleotide (530) 
provides the means for its identification (535). The abundance data of the bioagent identifying 
amplicon is recorded (540) and the abundance data for the calibration data is recorded (545), 
both of which are used in a calculation (550) which determines the quantity of unknown bioagent 
in the sample. 

[0082] In some embodiments, construction of a standard curve where the amount of calibration 
polynucleotide spiked into the sample is varied, provides additional resolution and improved 
confidence for the determination of the quantity of bioagent in the sample. The use of standard 
curves for analytical determination of molecular quantities is well known to one with ordinary 
skill and can be performed without undue experimentation. 

[0083] In some embodiments, multiplex amplification is performed where multiple bioagent 
identifying amplicons are amplified -with multiple primer pairs which also amplify the 
corresponding standard calibration sequences. In this or other embodiments, the standard 
calibration sequences are optionally included within a single vector which functions as the 



WO 2005/098047 PCT/US2005/005356 
-24- 

calibration polynucleotide. Multiplex amplification methods are well known to those with 
ordinary skill and can be performed without undue experimentation. 

[0084] In some embodiments, the calibrant polynucleotide is used as an internal positive control 
to confirm that amplification conditions and subsequent analysis steps are successful in 
producing a measurable amplicon. Even in the absence of copies of the genome of a bioagent, 
the calibration polynucleotide should give rise to a calibration amplicon. Failure to produce a 
measurable calibration amplicon indicates a failure of amplification or subsequent analysis step 
such as amplicon purification or molecular mass determination. Reaching a conclusion that such 
failures have occurred is in itself, a useful event. 

[0085] In some embodiments, the calibration sequence is inserted into a vector which then itself 
functions as the calibration polynucleotide. In some embodiments, more than one calibration 
sequence is inserted into the vector that functions as the calibration polynucleotide. Such a 
calibration polynucleotide is herein termed a "combination calibration polynucleotide." The 
process of inserting polynucleotides into vectors is routine to those skilled in the art and can be 
accomplished without undue experimentation. Thus, it should be recognized that the calibration 
method should not be limited to the embodiments described herein. The calibration method can 
be applied for determination of the quantity of any bioagent identifying amplicon when an 
appropriate standard calibrant polynucleotide sequence is designed and used. The process of 
choosing an appropriate vector for insertion of a calibrant is also a routine operation that can be 
accomplished by one with ordinary skill without undue experimentation. 

[0086] The present invention also provides kits for carrying out, for example, the methods 
described herein. In some embodiments, the kit may comprise a sufficient quantity of one or 
more primer pairs to perform an amplification reaction on a target polynucleotide from a 
bioagent to form a bioagent identifying amplicon. In some embodiments, the kit may comprise 
from one to fifty primer pairs, from one to twenty primer pairs, from one to ten primer pairs, or 
from two to five primer pairs. In some embodiments, the kit may comprise one or more primer 
pairs recited in Table 1 . 

[0087] In some embodiments, the kit may comprise one or more broad range survey primer(s), 
division wide primer(s), clade group primer(s) or drill-down primer(s), or any combination 
thereof. A kit may be designed so as to comprise particular primer pairs for identification of a 
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paxticular bioagent. For example, a broad range survey primer kit may be used initially to 
identify an unknown bioagent as a member of the Bacillus/Clostridia group. Another example of 
a division-wide kit may be used to distinguish Bacillus anthracis, Bacillus cereus and Bacillus 
thuringiensis from each other. A clade group primer kit may be used, for example, to identify an 
unknown bacterium as a member of the Bacillus cereus clade group. A drill-down kit may be 
used, for example, to identify genetically engineered Bacillus anthracis. In some embodiments, 
any of these kits may be combined to comprise a combination of broad range survey primers and 
division-wide primers, clade group primers or drill-down primers, or any combination thereof, 
for identification of an unknown bacterial bioagent. 

[0O88] In some embodiments, the kit may contain standardized calibration polynucleotides for 
use as internal amplification calibrants. Internal calibrants are described in commonly owned 
U.S. Patent Application Serial No: 60/545,425 which is incorporated herein by reference in its 
entirety. 

[0O89] In some embodiments, the kit may also comprise a sufficient quantity of reverse 
transcriptase (if an RNA virus is to be identified for example), a DNA polymerase, suitable 
nucleoside triphosphates (including any of those described above), a DNA ligase, and/or reaction 
buffer, or any combination thereof, for the amplification processes described above. A kit may 
further include instructions pertinent for the particular embodiment of the kit, such instructions 
describing the primer pairs and amplification conditions for operation of the method. A kit may 
also comprise amplification reaction containers such as microcentrifuge tubes and the like. A kit 
may also comprise reagents or other materials for isolating bioagent nucleic acid or bioagent 
identifying amplicons from amplification, including, for example, detergents, solvents, or ion 
exxhange resins which may be linked to magnetic beads. A kit may also comprise a table of 
measured or calculated molecular masses and/or base compositions of bioagents using the primer 
pairs of the kit. 

[0O90] In order that the invention disclosed herein may be more efficiently understood, examples 
are provided below. It should be understood that these examples are for illustrative purposes only 
arid are not to be construed as limiting the invention in any manner. Throughout these examples, 
molecular cloning reactions, and other standard recombinant DNA techniques, were carried out 
according to methods described in Maniatis et al., Molecular Cloning - A Laboratory Manual, 
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2nd ed., Cold Spring Harbor Press (1989), using commercially available reagents, except where 
otherwise noted. 

EXAMPLES 

[0091] Example 1: Selection of Primers That Define Bioagent Identifying Amplicons 
[0092] For design of primers that define bacterial bioagent identifying amplicons, relevant 
sequences from, for example, GenBank are obtained, aligned and scanned for regions where 
pairs of PCR primers would amplify products of about 45 to about 200 nucleotides in length and 
distinguish species from each other by their molecular masses or base compositions. A typical 
process shown in Figure 2 is employed. 

[0093] A database of expected base compositions for each primer region is generated using an in 
silico PCR search algorithm, such as (ePCR). An existing RNA structure search algorithm 
(Macke et al., Nuc. Acids Res., 2001, 29, 4724-4735, which is incorporated herein by reference 
in its entirety) has been modified to include PCR parameters such as hybridization conditions, 
mismatches, and thermodynamic calculations (SantaLucia, Proc. Natl. Acad. Sci. U.S.A., 1998, 
95, 1460-1465, which is incorporated herein by reference in its entirety). This also provides 
information on primer specificity of the selected primer pairs. 

[0094] Table 1 represents a collection of primers (sorted by forward primer name) designed to 
identify bacteria using the methods herein described. The forward or reverse primer name 
indicates the gene region of bacterial genome to which the primer hybridizes relative to a 
reference sequence eg: the forward primer name 16S_EC_1077_1 106 indicates that the primer 
hybridizes to residues 1077-1 106 of the gene encoding 16S ribosomal RNA in an E. coli 
reference sequence represented by a sequence extraction of coordinates 403 3 120. .4034661 from 
GenBank gi number 16127994 (as indicated in Table 2). As an additional example: the forward 
primer name BONTA_X52066_450_473 indicates that the primer hybridizes to residues 450- 
437 of the gene encoding Clostridium botulinum neurotoxin type A (BoNT/A) represented by 
GenBank Accession NTo. X52066 (primer pair name codes appearing in Table 1 are defined in 
Table 2). In Table 1, U a = 5-propynyluracil; C a = 5-propynylcytosine; * = phosphorothioate 
linkage. The primer pair number is an in-house database index number. 

Table 1: Primer Pairs for Identification of Bacterial Bio agents 

Primer For. I For. Rev. 
pair primer SEQ ID Rev. primer SEQ ID 

number name Forward sequence NO: name Reverse sequence NO: 

1 | 16S EC 107 1 GTGAGATGTTGGGTTAA | 1 | 16S EC 1175 | GACGTCATCCCCACCTTCC | 368 
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16S EC 774 



GTCCCGTAACGAG 
ATGTTGGGTTAAGTCCC 



ATGTTGGGTTAAGTCCC 



ATGTTGGGTTAAGTCCC 

GC 

ATGTTGGGTTAAGTCCC 



TAGTCCCGCAACGAGCG 



CAACGAGCGCAACCCT T 
CAAGTCATCATGGCCC T 



GCTACACACGTGCTACJA 



CTTGTACACACCGCCCG 



CACTGGAACTGAGACAC 



AGGGTTGTAAA 

TAACACATGCAAGTCG.A 



TAACACATGCAAGTCG.A 

ACG 

TAACACATGCAAGTCG.A 



CCAGCAGCCGCGGTAAT 



CGGAATTACTGGGCGT.A 



CGGAATTACTGGGCGT.A 



GTGTAGCGGTGAAATGC 



' -i GGTGAAATGC 



GTGTAGCGGTGAAATGC 

G 

GTGTAGCGGTGAAATGC 



AGAACACCGATGGCGA2V 



C-GGAGCAAACAGG&TT.A 



16S_EC_1177 
__1196_10G_1 
1G R 



16S EC 556 



16S EC 774 



TGACGTCATGGCCACCTTC 

C 

TGACGTCATGCCCACCTTC 



TGACGTCATCCCCACCTTC 



AAGGAGGTGATCCAGCC 



TGACGTCATCCCCACCTTC 



TGACGTCATCCCCACCTTC 



GACGTCATCCCCACCTTCC 



TCCCCACCTTCCTCC 



CGAGTTGCAGACTGCGATC 



GACGGGCGGTGTGTACAAG 



GACGGGCGGTGTGTACAAG 
ACCTTGTTACGACTTCACC 



CACGGCTACCTTGTTACGA 



CCTTGTTACGACTTCACCC 



TACGCATTACTCACCCGTC 



CTTTACGCCCAGTAATTCC 
TTACTCACCCGTCCGCCGC 



CGGCTGCTGGCACGAAGTT 



TTACTCACCCGTCCGCC 



ACGACACGAGCTGACGAC 



CGTACTCCCCAGGCG 

GTATCTAATCCTGTTTGCT 



CGCATTTCACCGCTACAC 
GTATCTAATCCTGTTTGCT 



CGAGTTGCAGACTGCGATC 
GTATCTAATCCTGTTTGCT 



CGTACTCCCCAGGCG 

GGTAAGGTTCTTCGCGTTG 
TGTTACTCACCCGTCTGCC 



CGTGGACTACCAGGGTATC 



CGTACTCCCCAGGCG 
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23S EC 182 



TGGATTAGAGACCCTGG 



GGATTAGATACCCTGGT 



TAGATACCCTGGTAGTC 



TAGATACCCTGGTAGTC 



AGAGTTTGATCATGGCT 

GAG 

AGAGTTTGATCATGGCT 



ACCACGCCGTAAACGAT 



AAGCGGTGGAGCATGTG 



ACGCGAAGAACCTTA 



ACGCGAAGAACCTTACC 

ACGCGAAGAACCTTACC 
GCGAAGAACCTTACCAG 



CGAAGAAU'U'TTACC 



CGAAGAACCTTACC 
TGCGCGGAAGATGTAAC 



TACCCCAAACCGACACA 

GG 

CCGTAACTTCGGGAGAA 



CTGACACCTGCCCGGTG 



TCTGACACCTGCCCGGT 



ACCTGCCCAGTGCTGGA 



GGTGGATGCCTTGGC 



GACAGTTCGGTCCCTAT 



TCTGTTCTTAGTACGAG 



897 TMOD R 



GGCCGTACTCCCCAGGCG 



TGGCCGTACTCCCCAGGCG 
GGCCGTACTCCCCAGGCG 



CGTACTCCCCAGGCG 



GCGACCGTACTCCCCAGG 



GCCTTGCGACCGTACTCCC 
AAGGAGGTGATCCAGCC 



ACTGCTGCCTCCCGTAG 
CCCCCGTCAATTCCTTTGA 



ATTGTAGCACGTC 



ACGAGCTGACGACAGCCAT 



TACGAGCTGACGACAGCCA 



ACGACACGAGU a C a G 



ACGACACGAGCTGACGAC 

GACGGGCGGTGTGTACAAG 
ACAACCATGCACCACCTGT 



ACACGAGU a C a GAC 



ACACGAGCTGAC 



TCGGACTCGCTTTCGCTAC 



23S BS 5 21 



CCTTCTCCCGAAGTTACG 



CACCGGGCAGGCGTC 



GACCGTTATAGTTACGGCC 



23S_EC_1906 



CCGACAAGGAATTTCGCTA 



TCGCTACCTTAGGACCGT 
TTCGCTCGCCGCTAC 



23S EC 2490 



23S EC 2658 



23S EC 2745 



GGGTTTCCCCATTCGG 
AGCCGACATCGAGGTGCCA 



AGTCCATCCCGGTCCTCTC 



TGCTTAGATGCTTTCAGC 
TTCGTGCTTAGATGCTTTC 



TTTCGTGCTTAGATGCTTT 
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CTAGTACGAGAGGACCG 



TAGTACGAGAGGACCGG 
GGGGAGTGAAAGAGATC 



ACTTAGATGCTTTCAGCGG 



TTAGATGCTTTCAGCACTT 
ACAAAAGGCACGCCATCAC 



ACAAAAGGTACGCCGTCAC 



CGAGAGGGAAACAACCC 



TGGCTGCTTCTAAGCCAAC 



OIF007_12 
1296 R 



AB_MLST-11- 

OI?0C7_1299 
1316 R 



TGCACCTGCGGTCGAGGG 



AB_MLST-11- 
OIF007_1335 
1362 R 



TAGGTTTACGTCAGTAT 



AB_MLST-11- 

OIF0C7JL47I 
L494 R 



TTATGGATGGCAACGTG 



AB_KLST-11- 
OIF007_1470 
1494 R 



TCGCTTGAGTGTAGTCATG 



AB_MLST-11- 

OIF007_1731 
1757 R 



AB_MLST-11- 
OIFO07_1790 
1821 R 



A3_MLST-11- 
OIF007_1875 
1909 R 



ABJMLST-11- 
OIF007JL895 
1927 R 



OIF007 1B5 



TATTGTTTCAAATGTAC 



AB_MLST-11- 
OIF007_291_ 
324 R 



TCACAGGTTCTACTTCATC 



CTTTGTCTGAAGATGG 



3_MLST-11- 
OIF007_2O97 
2118 R 



T GAC G GC AT C GAT AC C AC C 



OIF0O7 206 
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AB_MLST-11- 
OIF007_364_ 
393 R 



AB_MLST-11- 
OIF007_587_ 
610 R 



T TCTGCTTGAGGAATAGTG 



OIF007 601 



AB_MLST-11- 
OIF007_710_ 
736 R 



1151 
1100 
1101 



T" TGTACATTTGAAACAATA 



GCACAACCTGCGGCTGC 



A.CGGCACGAGGTAGTCGC 



TATGGCTCTACTCAA 



BONTA_X520 
66 538 552 



T7TACTTCTAACCCACTC 



:AF1_AF053 
)47_33407_ 
33430 F 



T7GCGGGCTGGTTCAACAAG 



CAF1_AF0539 
47_33499_33 
517 R 



T7GATGCGGGCTGGTTCAAC 



CAF1_AF053 
'_33515_ 
33541 F 



CAF1_AF053 
947_33687_ 
33716 F 



CAF1_AF0539 
47_33755_33 
7B2 R 



T7CAAGGTTCTCACCGTTTA 



CAPC BA 10 



TTGAATCTTGAAACACCATA 



CAPC_BA 11 



ACTCGTTTTTAATCAGC 



T7GAATCTTGAAACACCATA 



GGTAACCCTTGTCTTTGAA 



T7 GGTAACCCTTGTCTTTG 



CJST CJ 10 



CJST CJ 116 



T? GGTAACCCTTGTCTTTG 
T7CCCCTCATGTTTAAATGA 
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1063 
1050 
1058 



CTTTTTGATTCTTT 



TGGCTTATCCAAATTTA 



TCAGGATAAAAAGC 

TCGGTTTAAGCTCTACJVTG 



TCGATCCGCATCACCA.TCA 



1059 
1046 



65 2194 F 



1062 
1065 



TGAAGCTTGTTCTTTAG 



TACAACTGGTTCAAAA-ACA 



TCCTGTTATCCCTGAAG 



1052 
1047 



AGAGTATTAGAAGCTAG 



CJST CJ 663 



AAAAATGTCCAAGAAG 



1096 
1097 



CTXA_VBC_19 



GAAAGAGTTCGGATTGG 



TGTTGACCATGCTTCr TAG 



CTTCTACATTTTTAGC CAT 



TGTTAACGGCTTCAA&_ACC 



TCGAAGTACAATACAAG 



TTGTTAACGGCTTCAJ^GAC 



CGGCTTCAAGACCCC 



flCCACTTTTAATAAG&TTT 



GGTTTAGTACCAGAACA 



CCACTTTTAATAAGGX" TTG 



CGCGGTCGGCTCGTT&-ATG 



1104 
1103 
1092 



TCCGTTCTTACAAATAG 



GCTCTTAAAGATG 
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1091 
1095 



1128 
1130 
1129 



GYRB AB008 



INFE_EC_13 
65_1393_TM 
OP F 



TGGAAGATCTGGGTCAG 



TAGTTGCTCAAACAGCT 



CGTCAGGGTAAATTCCG 



GROL EC 57 7 



G YRA_AF 1005 



TAGCCGCGGTCGAATTGCA 



CAATCTGCTGACGGATCTG 



GYRB_AB0087 



TCACGCGCATCATCACCAG 



GYRB_AB 0 037 



ICD_CXB_172 



TAGCCCAGCTGTTTGAGCA 



TCCCTAATAGTAGAAATAA 



TAGGATTTTTCCACGGCGG 



CATGATGGTCACAACCGG 



TCGGCATCACGCCGTCGTC 



AACTTCGCCTTCGGTCATG 



TGGTAACAGAGCCTTAT 



1619 1643 R 



TCACGCGACGAGTGCCATC 
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TGAGGACCGTGTCGCGC 



TTTTCCAGCCATGCAGCGA 



TGTCACTCCCGACACGCCA 



IS1111A_NC0 
02971_6928_ 
6954 R 



IS1111A_NC 
002971_745 
6 7483 F 



TTTCATCGTTC 



IS1111A_NC0 
02971_7529_ 
7554 R 



TCAAGAAGAAAAAGAGC 



GAATATCAATTTGTAGC 



TTTACAGCTTTATGCAC 



T C T T G AC AGC AT C C GTT G 



CAGATAAAGAATCGCTCCA 



CAACGGATGCTGGCAAG 



LL_NC00314 
3_2366996_ 
57019 F 



LL_NC003143 
>367073_23 
67097 R 



j_NC00314 
_2367172_ 
2367194 F 



j_NC003143 
>367249_23 
67271 R 



TGATCCTGAATGTTTATAT 



TCAGGTACTGCTATCCA 



MECA Y14051 



MECA_Y1405 
1_4520_453 
OP F 



TU a U"AU a U a U a C a n a AA 



C a AU"C a U a AC a GU a U a A 



MECA_Y1405 
_4520 453 
OP F 



TU a O ! 'AU a U a U i 'C a U a AA 



AAAAATATTAACA 



CTTTTTGCCA 



GGTTACACTTTCC 



27 424_453_ 



TGCCTCGAAGCTGAATA 



TCAACGGTAACTTCTAT 



TTTAAGCGCCAGAAAGCAC 
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CAGAATCAAGTTCCCAG 



AATCTGCTATTTGGTCA 



TCCTGAAAAATGGAGCA 
CGG 

TGGAGCACGGCTTCTGA 



TCAGCGCGTACAGTGGG 



TGGTGACTCGGCATGTT 



TACAGGAGCAGCAGGCTTC 



TGGTTGTAGTTCCTGTAGT 



CCT GTAGT AGAAGAGGTAA 



TGATTATCAGCGGAAGTAG 



CCGTGCTCCATTTTTCAG 
TCGGATAAGCTGCCACAAG 



TCGGATAAGCTGCCACAAG 



PARC_X95819 



PLA_AF05394 



TTATACCGGAAACTTCC 



PLA_AF0539 
45_7382_74 
04 F 



TGGTCTGAGTACCTCCTTT 



PLA_AF0539 
45_7481_75 
03 F 



469 169 19 



RECA AF251 



TGGCACGGCCATCTCCG 



TCGTTTCACCCTGTCATGC 



TGCGGGTAGGGAGCTTG 



TCCGATAAGCCGGATTCTG 



RNASEP_BKM_ 



TCTCTTACCCCACCCTTTC 



TGCCTCGTGCAACCCACCC 



RNASEP_BRM 



TAAACCCCATCGGGA3C 



GAGGAAAGTCCATGCTC 

GC 

GAGGAAAGTCCATGCTC 



GAGGAAAGTCCATGCTC 



TGCCTCGCGCAACCTACCC 



GTAAGCCATGTTTTGTTCC 



GTAAGCCATGTTTTGTTCC 
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1076 
1075 



GAGGAAAGTCCATGCTC 



TAAGGATAGTGCAACAG 



ATAAGCCATGTTCTGTTCC 



RNASEP CLB_ 



GTAAGCCATGTTTTGTTCC 



GAGGAAAGTCCGGGCTC 



GAGGAA7AGTCCGGGCTC 



RNASEP_EC_3 



RNASEP_EC_ 



GAGGAAAGTCCGGGCTC 



ATAAGCCGGGTTCTGTCG 



ATAAGCCATGTTCTGTTCC 



GAGGAAAGTCCGGGCTC 



1085 
1082 
1083 
1086 



RNASEP_RKP 



RNASEP SA_ 



GAGGAAAGTCCATGCTC 



GTAAGCCATGTK 



RNASEP SA 



ATAAGCCGGGTTCTGTCG 



RNASEP_SA 



ATAAGCCATGTTCTGTTCC 



RNASEP SA 



GAGGAAAGTCCATGCTC 



ATAAGCCATGTTCTGTTCC 



TCCGCGGAGTTGACTGG 



TCTGTAAT GAACC 



CATCCACACGGTG 



RPLB EC 736 



GTGCTGGTTTACCCCATGG 



GGTTTACCCC 



TGTGCTGGTTTACCCCATG 



GACCACCTCGGCAACCG 



TTCGCTCTCGGCCTGGCC 



RPOB_EC_l 
4 5_18 66_1 
OP F 



RPOB_EC_37 



TTATCGCTCAGGCGAAC 



CTTGGAGGTAAGTCTCA 



TCGTCGCGGACTTCGAAGC 



TGCTGGATTCGCCTTTGCT 



TTTCTTGAAGAGTATGAGC 



RPOB EC_38 6 
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CAGCGTTTCGGCGAAAT 



RPOC_EC_109 



T G GT AAAC C AC 



GGCGCTTGTACTTACCGCA 



ACCCAGTGCTGCTGAAC 



GTTCAAATGCCTGGATACC 



CGCCGACTTCGACGGTG 



GAGCATCAGCGTGCGTGCT 



TGAGCATCAGCGTGCGTGC 



ACGCGGGCATGCAGAGATG 



TCAGGAGTCGTTCAACT 



TTACGCCATCAGGCCACGC 



ACGCCATCAGGCCACGCAT 



TCAGGAGTCGTTCAACT 



TACGCCATCAGGCCACGCA 



TTGGCCATCAGACCACGCA 



TGATTCTGGTGCCCGTG 



TTGGCCATCAGGCCACGCA 



RPOC EC 231 



RPOC_EC_22 
18_2241_TM 
OP F 



TAACAACCG 



TATTGGACAACGGTCGT 



TC^GGACAACGGTCGTC 



SP101_SPBT 
__1154_11 
7S F 



SP101 SPET 



SP101 SPET 



GTTTCCGTCA 



TCAATACCGCAACAGCG 



CGCAAAAAAATCCfeGCT 



SP101_SPE1 

L_92_116J1 
OD R 



SP101_SPET1 
1_1251_1277 
TMOD R 



TGTGGCCGATTTCACCACC 



AAACTATTTTTTTAGCTAT 
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SP101 SPET 
11_1314_13 

6 TMOD F 
SP101_SPET 

._1408_14 
37 F 



SPlOlJSPET 



CAGAAACTGGCT 



._1403_1431 

R 

SP101_SPET1 
1_1403_1431 
TMOD R 



SP101_SPET1 
1_1486_1515 
TMOD R 



ACTCGAACAC 



SP101_SPET 
L_1688_17 
5 TMOD F 



. 1783 1808 



SP101_SPET 



. 1808 1835 



GCGTGACGACCTTCTTGAA 



SP101_SPET 
11_1711_17 

3 TMOD F 



SP101_SPET 
11_1807_18 
35 F 



SP101_SPET 
11_1807_18 
35 TMOD F 



SP101_SPET1 
1_1901_1927 
TMOD R 



ATTGCCCAGAAATCAAATC 



P101_SPET 
1_1967_19 
1 TMOD F 



SP101_SPET 
11_216_2 4 3 

TMOD F 
SP101_SPET 
11_2260_22 

83 F 

SP101_SPET 
11_2260_22 
83 TMOD F 



SP101_SPET 
L_2375_23 
99 TMOD F 



01_SPET 
11_2468_24 
7 TMOD F 
?101_SPET 
11_266_295 



?101_SPET 
L_2961_29 
8 4 TMOD F 



GGCCACATGATT 



ATGGCCATGGCAGAAGC 



SP101_SPET1 
1_2375_2397 



01_SPET1 
1_2543_2570 
TMOD R 



SP101_SPET1 

l_355_380_T 
MOD R 



TCTGGGTGACCTGGTGTTT 



TTCTGGGTGACCTGGTGTT 



TGATGACTTTTTA5CTA 



SP101_SPET1 
1_3023_3045 



SP101 SPET1 



GGAATTTACCAGCGATAGA 



AATCGACGACCATCTTGGA 



TAATCGACGACCATCTTGG 
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SP101_SPET 
L_3085_31 
04 F 



SP101_SPET 
11_322_344 
TMOD F 



SP101_SPET 
L_3386_34 
03 TMOD F 



SP101_SPET 
11_3511_35 
35 TMOD F 



SP101_SPET 
11_600_629 
TMOD F 



SP101_SPET 
11_658_684 
TMOD F 



ATGGTOAGGCAGC 



TAGCTAJVTGGTCAGGCA 



AGCGTAAAGGTGAACCT 



TAGCGT.&AAGGTGAACC 



TTCGCCAATCAAAACTA 



AAAGATTTCTC 



ATCCCCTGCTTCTGCTGCC 



SP101_SPET1 



SP101_SPET1 



SP101_SPET1 
1_3605_3629 
TMOD R 



TATCCCCTGCTTCTGCTGC 



TGGGTCTACACCTGCACTT 



CCAACCTTTTCCACAACAG 



SP101_SPET1 
1_448_473_T 
MOD R 



TACCTTTTCCACAACAGAA 



SP101_S?ET1 
1_756_784_T 
MOD R 



TGCGATTGCGCG 



SP101_SPET 
U_893_921 
TMOD F 



,01_SPET1 
1_988_1012_ 
TMOD R 



TGCACGTCTGTTTCAGTTG 



222 TMOD R 



AACGCA.CAATCAGAAGC 



TCTGTTTCAGTTGCAAATT 



SSPE_BA_202 



TGCTTCTGGTGCTAGCA 



TGCTTCTGGC"GU'*C°AG 



SS?E_BA_243 
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TGGTGCTAGCATT 



TGGC a GU a C'AGU°AT? 



TGGTACAGAGTTTGCGA 



TACAGAGTTTGCGAC 



TCAAATGTACAAGGTGA 



SSPE_BA_163 



TRPE_AY0943 



TGCAGCTGATTGT 



TGU a AGU a TGAC a C a GT 



TTTGAATGCT 

TCATTTGTGCTTTGAATGC 



TGTGCC a C a C a GAAC a GU i 'T 



TCCTCTTTTCACAGGCTCT 



TRPE AY094 



TRPE AY0 94 



GTGCATGCGGATACAGA 



TGCAAGCGCGACCACA.T 



TGCCCAGGTACAACCTGCA 



TUFB EC 22 



GCCGTCCATTTGAGCAGCA 



GCCGTCCATCTGAGCAGCA 



TUFB_EC_337 



TATGTGCTCACGAGTTTGC 



TUFB EC 27 



TUFB EC 75 



AAGACGACCTGCACGGG 



GCGCTCCACGTCTTCACGC 



AACTACCGTCCTCAGT T 



CGTGGCGGCGTGGTTAT 



ACGAACTGGATGTCGCCGT 



TCGGTACGAACTGGATGTC 



WAAA_Z9692 



CGACGCGCTGCGCTTCA 



ACCGAGCAAGGAGACCA 



I' VJ L'GGT] [CM 3. 



VALS EC 123 



WAAA Z96925 



T5GCACC-AGCCTGACCTGT 



WO 2005/098047 



-40- 



PCT/US2005/005356 



| 5_28 6_311_ | TGTTTCAGT j | _394_412_R 



[O095] Primer pair name codes and reference sequences are shown in Table 2. The primer name 
code typically represents the gene to which the given primer pair is targeted. The primer pair 
name includes coordinates with respect to a reference sequence defined by an extraction of a 
section of sequence or defined by a GenBank gi number, or the corresponding complementary 
sequence of the extraction, or the entire GenBank gi number as indicated by the label "no 
extraction." Where "no extraction" is indicated for a reference sequence, the coordinates of a 
primer pair named to the reference sequence are with respect to the GenBank gi listing. Gene 
abbreviations are shown in bold type in the "Gene Name" column. 



Table 2: Primer Name Codes and Reference Sequences 



Primer 


Gene Name 


Organism 


Reference 
gi number 


Extracted gene 
coordinates of gi 


Extraction 
SEQ ID NO: 


16S EC 


16S rRNA (16S 
ribosomal RNA 


Escherichia 


16127994 


4033120. .4034661 


719 




23S rRNA (23S 
ribosomal RNA 
gene) 


coli 


16127 994 


4166230. .4169123 


720 


CAPC BA 


cape (capsule 
biosynthesis gene) 


Bacillus 


6470151 


Complement 
(55628. .56074) 


721 


CYA BA 


gLe) ' y 


Bacillus 


48942X6 


Complement 
(154288. .156626) 


722 




dnaK (chaperone 
dnaK gene) 


coli 


16127994 


12163. .14079 


723 




groL (chaperonin 


Escherichia 


- 6127 994 


4368603. .4370249 


724 


HFLB EC 


hflb (cell 
division protein 
peptidase ftsH) 


Escherichia 


16127994 


Complement 
(3322645. .3324576) 


725 


INFB EC 


infB (protein 
factor infB gene) 


Escherichia 
coli 


16127994 


Complement 
(3310983. .3313655) 




LEF BA 


lef (lethal 


Bacillus 


21392688 


Complement 
(149357. .151786) 


727 


PAG BA 


pag (protective 
antigen) 


Bacillus 


21392 688 


143779. .146073 


728 


RPLB EC 


ribosomal protein 
L2) 


Escherichia 
coli 


16127994 


3449001. .3448180 


729 


RPOB EC 


rpoB (DNA-directed 
RNA polymerase 
beta chain) 


Escherichia 


6127994 


Complement 
4178823. .4182851 


730 


RPOC EC 


rpoC (DNA-directed 
RNA polymerase 


Escherichia 
coli 


16127 994 


4182928. .4187151 


731 


SP101ET 
SPET 1 

1 


Concatenation 
comprising: 

gki (glucose 
kinase) 

gtr (glutamine 

transporter 

protein) 

murl (glutamate 
mutS (DNA mismatch 


Artificial 

partial gene 
sequences of 

pyogenes 


15674250 


Complement 
(1258294. .1258791) 

complement 
(1236751. .1237200) 

312732. .313169 

Complement 


732 
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(1787602. .1788007) 






xpt (xanthine 
phosphoribosyl 
•transferase) 






930977. .931425 






yqiL (acetyl-CoA- 

acetyl 

transferase) 






129471. .129903 






tkt 

( trans ketolase) 






1391844. .1391386 




SSPE BA 


sspE (small acid- 
soluble spore 


Bacillus 


30253828 


22 6496. .2267B3 


733 


TDFB EC 


tufB (Elongation 


Escherichia 
coli 


16127994 


41-73523. .4174707 


734 


VALS EC 


-valS (Valyl-tRNA 
synthetase) 


Escherichia 
coli 


16127994 


Complement 
(4481405. .4478550) 


735 


ASPS EC 


asps (Aspartyl- 
tRNA synthetase) 






complement (194 6777. . 
1948546) 


736 


CAF1 AF 
053947 


cafl (capsular 
protein cafl) 


Yersinia 
pestis 


2996286 


No extraction 
GenBank coordinates 
used 




INV U22 
457 






1256565 


74 . .3772 


737 


LL_NCOO 
3143 


y. pestis specific 
chromosomal genes 
- difference 


Yersinia 


16120353 


No extraction 
GenBank coordinates 
used 




BONTA X 
52066 


BoNT/A (neurotoxin 
type A) 


Clostridium 
botulinum 


40381 


77 . .3967 


738 


MECA Yl 
4051~ 


mecA methicillin 
resistance gene 


Staphylococcus 


2791983 


No extraction 
GenBank coordinates 


739 


TRPE AY 
094355 


trpE (anthranilate 
synthase (large 
component) ) 


Acinetobacter 
baumanii 


20853695 


No extraction 
GenBank coordinates 


740 


RECA AF 
251469 


recA (recombinase 

A) 


Acinetobacter 
baumanii 


9965210 


No extraction 
GenBank coordinates 
used 


741 


GYRA_AF 


gyrA (DNA gyrase 
subunit A) 


Acinetobacter 
baumanii 


4240540 


No extraction 
GenBank coordinates 


742 


GYRB AB 
008700 


gyrB (DNA gyrase 
subunit B) 


Acinetobacter 


4514436 


No extraction 
GenBank coordinates 


743 


WAAA Z9 
6925 


waaA (3-deoxy-D- 
manno-octulosonic- 
acid transferase) 


Acinetobacter 
baumanii 


2765828 


No extraction 
GenBank coordinates 


744 




Concatenation 
comprising: 










CJST_CJ 


tkt 

(transketolase) 

glyA (serine 
hydroxymethyltrans 
f erase) 

gltA (citrate 

aspA (aspartate 
ammonia lyase) 

glnA (glutamine 
synthase) 

(phosphoglycerate 


Artificial 
Sequence* - 
partial gene 
sequences of 
Campylobacter 
jejuni 


15791399 


15 69415. .1569873 
367573. .368079 

complement 
(1604529. .1604930) 

96 692. .97168 

complement 
(657609. .658085) 

327773. .328270 


745 
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uncA (ATP 
synthetase alpha 






112163. .112651 




RNASEP_ 


RNase P 

(ribonuclease P) 


Bordetella 
pertussis 


33591275 


Complement 
(3226720. .3227933) 




RNASEP_ 
BKM 


RNase P 

(ribonuclease P) 


Burkholderia 
mallei 


53723370 


(2527296. .2528220) 


7 7 


RNASEP 

BS 


RNase P 

(ribonuclease P) 


Bacillus 
subtilis 


16077068 


Complement 
(2330250. .2330962) 


748 


CLB ~~ 


RNase P 

(ribonuclease P) 


Clostridium 
perfringens 


18308982 


Complement 
(2291757. .2292584) 


749 


EC 


RNase P 

(ribonuclease P) 


Escherichia 
coli 


16127994 


Complement 
(3267457. .3268233 


750 


RNASEP_ 
RKP 


RNase P 


Rickettsia 
prowazekii 


15603881 


complement (605276. . 6 
06109) 


751 


RNASEP 
SA 


RNase P 

(ribonuclease P) 


Staphylococcus 


15922990 


complement (1559869. . 
1560651) 


752 


RNASEP 
VBC 


RNase P 
(ribonuclease P) 


cholerae 


15640032 


complement (2580367. . 
2581452) 






icd (isocitrate 
dehydrogenase) 


Coxiella 
burnetii 


29732244 


complement(1143867. . 
1144235) 


754 


IS1111A 


multi-locizs 
IS1111A insertion 
element 


Acinetobacter 
baumannii 


29732244 


No extraction 




OMPA AY 
485227 


membrane protein 
A) 


Rickettsia 
prowazekii 


40287451 


No extraction 


755 


OMPB RK 
P 


membrane protein 
B) 


SEES; 


15603881 


complement (881264 . . 8 
86195) 


756 


GLTA RK 
P 


gltA (citrate 


-VThTTo 


15603881 


complement (1062547. . 
1063857) 


757 


TOXR_VB 

C 


_gnthase) 

(transcription 
regulator tOXR) 


FranciseJIa 
tularensis 


15640032 


complement (1047143. . 
1048024) 


758 


ASD FRT 


semialdehyde 
dehydrogenase) 


Francisella 
tularensis 


56707187 


complement(438608. .4 
39702) 


759 


GALE_FR 
T 


galE (DDE>-glucose 
4-epimerase) 


Shigella 
flexneri 


56707187 


809039. .810058 


760 


IPAH SG 
F 


ipaH < invasion 
plasmid antigen) 


Campylobacter 
jejuni 


30061571 


2210775. .2211614 


751 


HUPB CJ 


hupB (DNA-binding 
protein Hu.-beta) 


Coxiella 
burnetii 


15791399 


complement (84 9317. .8 
49819) 


7 62 




Concatenation 

trpB (antnranilate 
synthase component 


Artificial 
Sequence* - 
partial gene 
sequences of 
Acinetofcacter 
baumannii 






763 


AB_MLST 


adk (adenylate 

mutY (adenine 
glycosylas e) 

fume (fumarate 
hydratase) 

efp (elong-ation 
factor p) 

ppa (pyrophosphate 

phospho- 

hydratase 






Sequenced in-house 
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[0096] * Note: These artificial reference sequences represent concatenations of partial gene 
extractions from the indicated reference gi number. Partial sequences were used to create the 
concatenated sequence because complete gene sequences were not necessary for primer design. 
The stretches of arbitrary residues "N"s were added for the convenience of separation of the 
partial gene extractions (100N for SP101_SPET11 (SEQ ID NO: 732); 50N for CJST_CJ (SEQ 
ID NO: 745); and 40N for ABMLST (SEQ ID NO: 763)). 

[0097] Example 2: DNA isolation and Amplification 

[0098] Genomic materials from culture samples or swabs were prepared using the DNeasy® 96 
Tissue Kit (Qiagen, Valencia, CA). All PCR reactions are assembled in 50 ul reactions in the 96 
well microtiter plate format using a Packard MPII liquid handling robotic platform and MJ 
Dyad® thermocyclers (MJ research, Walth.am, MA). The PCR reaction consisted of 4 units of 
Amplitaq Gold®, lx buffer II (Applied Biosystems, Foster City, CA), 1.5 mM MgCl 2 , 0.4 M 
betaine, 800 uM dNTP mix, and 250 nM of each primer. 

[0099] The following PCR conditions were used to amplify the sequences used for mass 
spectrometry analysis: 95C for 10 minutes followed by 8 cycles of 95C for 30 seconds, 48C for 
30 seconds, and 72C for 30 seconds, with the 48C annealing temperature increased 0.9C after 
each cycle. The PCR was then continued for 37 additional cycles of 95C for 15 seconds, 56C for 
20 seconds, and 72C for 20 seconds. 

[0100] Example 3: Solution Capture Purification of PCR Products for Mass Spectrometry 
with Ion Exchange Resin-Magnetic Beads 

[0101] For solution capture of nucleic acids with ion exchange resin linked to magnetic beads, 
25 ul of a 2.5 mg/mL suspension of BioClon amine terminated supraparamagnetic beads were 
added to 25 to 50 ul of a PCR reaction containing approximately 10 pM of a typical PCR 
amplification product. The above suspension was mixed for approximately 5 minutes by 
vortexing or pipetting, after which the liquid was removed after using a magnetic separator. The 
beads containing bound PCR amplification product were then washed 3x with 50mM ammonium 
bicarbonate/50% MeOH or lOOmM ammonium bicarbonate/50% MeOH, followed by three 
more washes with 50% MeOH. The bound PCR amplicon was eluted with 25mM piperidine, 
25mM imidazole, 35% MeOH, plus peptide calibration standards. 
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[0102] Example 4: Mass Spectrometry and Base Composition Analysis 
[0103] The ESI-FTICR mass spectrometer is based on a Bruker Daltonics (Billerica, MA) Apex 
II 70e electrospray ionization Fourier transform ion cyclotron resonance mass spectrometer that 
employs an actively shielded 7 Tesla superconducting magnet. The active shielding constrains 
the majority of the fringing magnetic field from the superconducting magnet to a relatively small 
volume. Thus, components that might be adversely affected by stray magnetic fields, such as 
CRT monitors, robotic components, and other electronics, can operate in close proximity to the 
FTICR spectrometer. All aspects of pulse sequence control and data acquisition were performed 
on a 600 MHz Pentium II data station running; Bruker' s Xmass software under Windows NT 4.0 
operating system. Sample aliquots, typically 1 5 pi, were extracted directly from 96-well 
microtiter plates using a CTC HTS PAL autos ampler (LEAP Technologies, Carrboro, NC) 
triggered by the FTICR data station. Samples were injected directly into a 10 ul sample loop 
integrated with a fiuidics handling system that supplies the 100 ul /hr flow rate to the ESI source. 
Ions were formed via electrospray ionization in a modified Analytica (Branford, CT) source 
employing an off axis, grounded electrospray probe positioned approximately 1.5 cm from the 
metalized terminus of a glass desolvation capillary. The atmospheric pressure end of the glass 
capillary was biased at 6000 V relative to the ESI needle during data acquisition. A counter- 
current flow of dry N 2 was employed to assist in the desolvation process. Ions were accumulated, 
in an external ion reservoir comprised of an rf-only hexapole, a skimmer cone, and an auxiliary 
gate electrode, prior to injection into the trapped ion cell where they were mass analyzed. 
Ionization duty cycles > 99% were achieved by simultaneously accumulating ions in the external 
ion reservoir during ion detection. Each detection event consisted of 1M data points digitized 
over 2.3 s. To improve the signal-to-noise ratio (S/N), 32 scans were co-added for a total data 
acquisition time of 74 s. 

[0104] The ESI-TOF mass spectrometer is based on a Bruker Daltonics MicroTOF™. Ions from 
the ESI source undergo orthogonal ion extraction and are focused in a refiectron prior to 
detection. The TOF and FTICR are equipped with the same automated sample handling and 
fiuidics described above. Ions are formed in the standard MicroTOF™ ESI source that is 
equipped with the same off-axis sprayer and glass capillary as the FTICR ESI source. 
Consequently, source conditions were the same as those described above. External ion 
accumulation was also employed to improve ionization duty cycle during data acquisition. Each 
detection event on the TOF was comprised of 75,000 data points digitized over 75 ps. 
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[0105] The sample delivery scheme allows sample aliquots to be rapidly injected into the 
electrospray source at high flow rate and subsequently be electrosprayed at a much lower flow 
rate for improved ESI sensitivity. Prior to injecting a sample, a "bolus of buffer was injected at a 
high flow rate to rinse the transfer line and spray needle to avoid sample 

contamination/carryover. Following the rinse step, the autosampler injected the next sample and 
the flow rate was switched to low flow. Following a brief equilibration delay, data acquisition 
commenced. As spectra were co-added, the autosampler continued rinsing the syringe and 
picking up buffer to rinse the injector and sample transfer line. In general, two syringe rinses 
and one injector rinse were required to minimize sample carryover. During a routine screening 
protocol a new sample mixture was injected every 106 seconds. More recently a fast wash 
station for the syringe needle has been implemented which, when combined with shorter 
acquisition times, facilitates the acquisition of mass spectra at a rate of just under one 
spectrum/minute. 

[0106] Raw mass spectra were post-calibrated with an internal mass standard and deconvoluted 
to monoisotopic molecular masses. Unambiguous base compositions were derived from the 
exact mass measurements of the complementary single-stranded oligonucleotides. Quantitative 
results are obtained by comparing the peak heights with an internal PCR calibration standard 
present in every PCR well at 500 molecules per well for the ribosomal DNA-targeted primers 
and 100 molecules per well for the protein-encoding gene targets. Calibration methods are 
commonly owned and disclosed in U.S. Provisional Patent Application Serial No. 60/545,425. 

[0107] Example 5: De Novo Determination of Base Composition of Amplification Products 
using Molecular Mass Modified Deoxynucleotide Triphosphates 

[0108] Because the molecular masses of the four natural nucleobases have a relatively narrow 
molecular mass range (A = 313.058, G = 329.052, C = 289.04S, T = 304.046 - See Table 3), a 
persistent source of ambiguity in assignment of base composition can occur as follows: two 
nucleic acid strands having different base composition may have a difference of about 1 Da 
when the base composition difference between the two strands is G <-> A (-15.994) combined 
with C <-> T (+15.000). For example, one 99-mer nucleic acid strand having a base composition 
of A27G30C21T21 has a theoretical molecular mass of 30779.058 while another 99-mer nucleic 
acid strand having a base composition of A 2 6G3iC22T 2 ohas a tlxeoretical molecular mass of 
30780.052. A 1 Da difference in molecular mass may be within the experimental error of a 
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molecular mass measurement and thus, the relatively narrow molecuLar mass range of the four 
natural nucleobases imposes an uncertainty factor. 

[0109] The present invention provides for a means for removing this theoretical 1 Da uncertainty 
factor through amplification of a nucleic acid with one mass-tagged nucleobase and three natural 
nucleobases. The term "nucleobase" as used herein is synonymous with other terms in use in the 
art including "nucleotide," "deoxynucleotide," "nucleotide residue," ""deoxynucleotide residue," 
"nucleotide triphosphate (NTP)," or deoxynucleotide triphosphate (dTsfTP). 



[0110] Addition of significant mass to one of the 4 nucleobases (dNTPs) in an amplification 
reaction, or in the primers themselves, will result in a significant difference in mass of the 
resulting amplification product (significantly greater than 1 Da) arising from ambiguities arising 
from the G <-» A combined with C T event (Table 3). Thus, the same the G <->■ A (-15.994) 
event combined with 5-Iodo-C *-»• T (-1 10.900) event would result in a molecular mass 
difference of 126.894. If the molecular mass of the base composition A27G30 5-Iodo-C 2 iT 2 i 
(33422.958) is compared with A 26 G3i5-Iodo-C22T2o, (33549.852) the theoretical molecular mass 
difference is +126.894. The experimental error of a molecular mass measurement is not 
significant with regard to this molecular mass difference. Furthermore, the only base 
composition consistent with a measured molecular mass of the 99-mer nucleic acid is A 2 yG3o5- 
Iodo-C2iT 2 i. In contrast, the analogous amplification without the mass tag has 18 possible base 
compositions. 

Table 3: Molecular Masses of Natural Nucleobases and the Mass-Modified Nucleobase 5- 
Iodo-C and Molecular Mass Differences Resulting from Transitions 



Nucleobase 



Molecular Mass 



R-->5-Iodo-C 



T — >5-Iodo-C 



A MolecuHar Mass 
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5-Iodo-C— >A 



5-Iodo-C-— >T 



[0111] Example 6: Data Processing 

[0112] Mass spectra of bioagent identifying amplicons are analyzed independently using a 
maximum-likelihood processor, such as is widely used in radar signal processing. This 
processor, referred to as GenX, first makes maximum likelihood estimates of the input to the 
mass spectrometer for each primer by running matched filters for each base composition 
aggregate on the input data. This includes the GenX response to a calibrant for eacfci primer. 



[0113] The algorithm emphasizes performance predictions culminating in probability-of- 
detection versus probability-of-false-alarm plots for conditions involving complex "backgrounds 
of naturally occurring organisms and environmental contaminants. Matched filters consist of a 
priori expectations of signal values given the set of primers used for each of the bioagents. A 
genomic sequence database is used to define the mass base count matched filters. The database 
contains the sequences of known bacterial bioagents and includes threat organisms as well as 
benign background organisms. The latter is used to estimate and subtract the spectral signature 
produced by the background organisms. A maximum likelihood detection of known background 
organisms is implemented using matched filters and a running-sum estimate of the noise 
covariance. Background signal strengths are estimated and used along with the matched filters to 
form signatures which are then subtracted, the maximum likelihood process is applied to this 
"cleaned up" data in a similar manner employing matched filters for the organisms and a 
running-sum estimate of the noise-covariance for the cleaned up data. 



[0114] The amplitudes of all base compositions of bioagent identifying amplicons for each 
primer are calibrated and a final maximum likelihood amplitude estimate per organism is made 
based upon the multiple single primer estimates. Models of all system noise are factored into this 
two-stage maximum likelihood calculation. The processor reports the number of molecules of 
each base composition contained in the spectra. The quantity of amplification product 
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corresponding to the appropriate primer set is reported as well as the quantities of primers 
remaining upon completion of the amplification reaction. 

[0115] Example 7: Use of Broad Range Survey and Division Wide Primer Pairs for 
Identification of Bacteria in an Epidemic Surveillance Investigation 
[0116] This investigation employed a set of 16 primer pairs which is herein designated the 
"surveillance primer set" and comprises broad range survey primer pairs, division wide primer 
pairs and a single Bacillus clade primer pair. The surveillance primer set is shown in Table 4 
and consists of primer pairs originally listed in Table 1 . This surveillance set comprises primers 
with T modifications (note TMOD designation in primer names) which constitutes a functional 
improvement with regard to prevention of non-templated adenylation (vide supra) relative to 
originally selected primers which are displayed below in the same row. Primer pair 449 (non-T 
modified) has been modified twice. Its predecessors are primer pairs 70 and 357, displayed 
below in the same row. Primer pair 360 has also been modified twice and its predecessors are 
primer pairs 17 and 118. 



Table 4: Bacterial Primer Pairs of the Surveillance Primer Set 



Primer 

Pair 

No. 


Forward Primer Name 


Forward 

(SEQ ID 
NO:) 


Reverse Primer Name 


Reverse 

(SEQ ID 
NO:) 


Target Gene 


346 


16S EC 713 732 TMOD F 


27 


16S EC 789 809 TMOD R 


389 


16S rRNA 




16S_EC_713_732_F 


26 


16S_EC_789_809 


388 


16S rRNA 


347 


16S EC 785 806 TMOD F 


30 


16S EC 880 897 TMOD R 


392 


16S rRNA 


11 


16S EC 785 806 F 


29 


16S EC 880 897 R 


391 


16S rRNA 


348 


16S EC 960 981 TMOD F 


3 8 


1 6 S_EC_1 0 5 4_1 0 7 3_TMOD_R 


3 63 


16S rRNA 


14 


16S EC 960 981 F 


37 


16S EC 1054 1073 R 


362 


16S rRNA 


349 


23S EC 1826 1843 TMOD F 


49 


23S_EC_1906_1924_TMOD_R 


405 


23 S rRNA 


16 


23S EC 1826 1843 F 


48 


23S EC 1905 1924 R 


404 


23 S rRNA 


352 


INFB EC 1365 1393 TMOD F 


161 


INFB EC 1439 1467 TMOD R 


515 


infB 


34 


INFB EC 1365 1393 F 


160 


;NFB EC 1439 1467 R 


515 


infB 


354 


RPOC EC 2218 2241 TMOD F 


262 


RPOC_EC_2 3 13_2 3 3 7_TMOD_R 


625 


rpoC 


52 


RPOC EC 2218 2241 F 


261 


RPOC EC 2313 2337 R 


624 




355 


SSPE BA 115 137 TMOD F 


321 


SSPE BA 197 222 TMOD R 


687 




58 


SSPE BA 115 137 F 


322 


SSPE BA 197 222 R 






356 


RPLB EC 650 679 TMOD F 


232 


RPLB EC 739 7 62 TMOD R 


592 


rplB 


66 


RPLB EC 650 67 9 F 


231 


RPLB EC 739 762 R 


591 


rplB 


358 


VALS EC 1105 1124 TMOD F 


350 


VALS_EC_1 1 95_1218_TMOD_R 


712 


valS 


71 


VALS EC 1105 1124 F 


349 


VALS EC 1195 1218 R 


711 


valS 


359 


RPOB EC 1845 1865 TMOD F 


241 


RPOB EC 1909 1929 TMOD R 


597 




72 


RPOB EC 1845 1866 F 


240 


RPOB EC 1909 1929 R 


596 




3 60 


23S EC 2646 2667 TMOD F 


60 


23S EC 2745 2765 TMOD R 


416 


23S rRNA 




23S__EC_264 6_2667_F 


59 


2 3SEC 27 4 52 7 6 5_R 


415 


23S rRNA 




23S EC 2645 2669 F 


58 


23S EC 2744 2761 R 




23S rRNA 
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361 


1 6S_EC_1 0 90_11 ll_2_TMOD_F 


5 


16S EC 1175 1196 TMOD R 


370 


16S rRNA 


3 


16S EC 1090 1111 2 F 


6 


16S EC 1175 1196 R 


369 


16S rRNA 


362 


RPOB EC 3799 3321 TMOD F 


245 


RPOB EC 3862 3888 TMOD R 


603 


rpoB 


289 


RPOB EC 3799 3821 F 


246 


RPOB EC 3862 3888 R 


602 


rpoB 




RPOC EC 2146 2174 TMOD F 




RPOC EC 2227 2245 TMOD R 






290 


RPOC EC 2146 2174 F 


256 


RPOC EC 2227 2245 R 






367 


TUFB EC 957 979 TMOD F 


345 


TUFB EC 1034 1058 TMOD R 


701 


tufB 


293 


TUFB EC 957 979 F 


344 


TUF3 EC 1034 1058 R 


700 


tufB 


449 


RPLB_EC_690_710_F 


237 


RPLB_EC_737_7 5 8_R 


589 


rplB 


357 


RPLB EC 688 710 TMOD F 




RPLB EC 736 757 TMOD R 


588 


rplB 


67 


RPLB EC 688 710 F 


235 


RPLB EC 736 757 R 


587 


rplB 



[0117] The 16 primer pairs of the surveillance set are used to produce bioagent identifying 
amplicons whose base compositions are sufficiently different amongst all known bacteria at the 
species level to identify, at a reasonable confidence level, any given bacterium at the species 
level. As shown in Tables 6A-E, common respiratory bacterial pathogens can be distinguished 
by the base compositions of bioagent identifying amplicons obtained using the 16 primer pairs of 
the surveillance set. In some cases, triangulation identification improves the confidence level for 
species assignment. For example, nucleic acid from Streptococcus pyogenes can be amplified by 
nine of the sixteen surveillance primer pairs and Streptococcus pneumoniae can be amplified by 
ten of the sixteen surveillance primer pairs. The base compositions of the bioagent identifying 
amplicons are identical for only one of the analogous bioagent identifying amplicons and differ 
in all of the remaining analogous bioagent identifying amplicons by up to four bases per bioagent 
identifying amplicon. The resolving power of the surveillance set was confirmed by 
determination of base compositions for 120 isolates of respiratory pathogens representing 70 
different bacterial species and the results indicated that natural variations (usually only one or 
two base substitutions per bioagent identifying amplicon) amongst multiple isolates of the same 
species did not prevent correct identification of major pathogenic organisms at the species level. 

[0118] Bacillus anthracis is a well known biological warfare agent which has emerged in 
domestic terrorism in recent years. Since it was envisioned to produce bioagent identifying 
amplicons for identification of Bacillus anthracis, additional drill-down analysis primers were 
designed to target genes present on virulence plasmids of Bacillus anthracis so that additional 
confidence could be reached in positive identification of this pathogenic organism. Three drill- 
down analysis primers were designed and are listed in Tables 1 and 5. In Table 5 the drill-down 
set comprises primers with T modifications (note TMOD designation in primer names) which 
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constitutes a functional improvement with regard to prevention of non-templated adenylation 
(vide supra) relative to originally selected primers which are displayed below in the same row. 
Table 5: Drill-Down Primer Pairs for Confirmation of Identification of Bacillus anthracis 



Primer 

Pair 

No. 


Forward Primer Name 


Forward 
Primer 
(SEQID 
NO:) 


Reverse Primer Name 


(SEQID 
NO:) 


Target Gene 




CAPC BA 274 303 TMOD F 


98 


CAPC BA 349 376 TMOD R 


452 




24 


CAPC BA 274 303 F 


97 


CAPC BA 349 376 R 


451 




351 


CYA BA 1353 1379 TMOD F 


128 


CYA BA 1448 1467 TMOD R 


483 




30 


CYA BA 1353 1379 F 


127 


CYA BA 1448 1467 R 


482 




353 


LEF BA 756 781 TMOD F 


175 


LEF BA 843 872 TMOD R 


531 


lef 


37 


LEF BA 756 781 F 


174 


LEF BA 843 872 R 


530 


lef 



[0119] Phylogenetic coverage of bacterial space of the sixteen surveillance primers of Table 4 
and the three Bacillus anthracis drill-down primers of Table 5 is shown in Figure 3 which lists 
common pathogenic bacteria. Figure 3 is not meant to be comprehensive in illustrating all 
species identified by the primers. Only pathogenic bacteria are listed as representative examples 
of the bacterial species that can be identified by the primers and methods of the present 
invention. Nucleic acid of groups of bacteria enclosed within the polygons of Figure 3 can be 
amplified to obtain bioagent identifying amplicons using the primer pair numbers listed in the 
upper right hand corner of each polygon. Primer coverage for polygons within polygons is 
additive. As an illustrative example, bioagent identifying amplicons can be obtained for 
Chlamydia trachomatis by amplification with, for example, primer pairs 346-349, 360 and 361, 
but not with any of the remaining primers of the surveillance primer set. On the other hand, 
bioagent identifying amplicons can be obtained from nucleic acid originating from Bacillus 
anthracis (located within 5 successive polygons) using, for example, any of the following primer 
pairs: 346-349, 360, 361 (base polygon), 356, 449 (second polygon), 352 (third polygon), 355 
(fourth polygon), 350, 351 and 353 (fifth polygon). Multiple coverage of a given organism with 
multiple primers provides for increased confidence level in identification of the organism as a 
result of enabling broad triangulation identification. 

[0120] In Tables 6A-E, base compositions of respiratory pathogens for primer target regions are 
shown. Two entries in a cell, represent variation in ribosomal DNA operons. The most 
predominant base composition is shown first and the minor (frequently a single operon) is 
indicated by an asterisk (*). Entries with NO DATA mean that the primer would not be expected 
to prime this species due to mismatches between the primer and target region, as determined by 
theoretical PGR. 
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Table 6A - Base Compositions of Common Respiratory Pathogens for Bioagent Identifying 



Amplicons Corresponding to Primer Pair Nos: 346, 347 and 348 



Organism 


Strain 


Primer 346 
[A G C T] 


Primer 347 
[A 6 C T] 


Primer 348 
[A G C T] 


Klebsiella 
pneumoniae 


MGH78578 


[29 32 25 13] 
[29 31 25 13]* 


[23 38 28 26] 
[23 37 28 26]* 


[26 32 28 30] 
[26 31 28 30]* 


Yersinia pestis 


CO-92 Biovar 
Orientalis 


[29 32 25 13] 


[22 39 28 26] 


[29 30 28 29] 
[30 30 27 29]* 


Yersinia pestis 


KIM5 P12 (Biovar 
Mediaevalis ) 


[29 32 25 13] 


[22 39 28 26] 


[29 30 28 29] 


Yersinia pestis 


91001 


[29 32 25 13] 


[22 39 28 26] 


[29 30 28 29] 
[30 30 27 29]* 


Haemophilus 
influenzae 


KW20 


[28 31 23 17] 


[24 37 25 27] 


[29 30 28 29] 


Pseudomonas 
aeruginosa 


PAOl 


[30 31 23 15] 


[26 36 29 24] 
[27 36 29 23]* 


[26 32 29 29] 


Pseudomonas 


Pf0-1 


[30 31 23 15] 


[26 35 29 25] 


[28 31 28 29] 


Pseudomonas 
putida 


KT2440 


[30 31 23 15] 


[28 33 27 27] 


[27 32 29 28] 


Legionella 
pneumophila 


Philadelphia-1 


[30 30 24 15] 


[33 33 23 27] 


[29 28 28 31] 


Francisella 
tularensis 


^chu_J 


[32 29 22 16] 


[28 38 26 26] 


[25 32 28 31] 


Bordetella 
pertussis 


Tohama I 


[30 29 24 16] 


[23 37 30 24] 


[30 32 30 26] 


Burkholderia 
cepacia 


J2315 


[29 29 27 14] 


[27 32 26 29] 


[27 36 31 24] 
[20 42 35 19]* 


Burkholderia 
pseudomallei 


K96243 


[29 29 27 14] 


[27 32 26 29] 


[27 36 31 24] 


Neisseria 
aonorrhoeae 


FA 1090, ATCC 
700825 


[29 28 24 18] 


[27 34 26 28] 


[24 36 29 27] 


Neisseria 
meningitidis 


MC58 (serogroup B) 


[29 28 26 16] 


[27 34 27 27] 


[25 35 30 26] 


Neisseria 
meningitidis 


serogroup C, FAM1B 


[29 28 26 16] 


[27 34 27 27] 


[25 35 30 26] 


Neisseria 
meningitidis 


Z2491 (serogroup A) 


[29 28 26 16] 


[27 34 27 27] 


[25 35 30 26] 


Chlamydophila 
pneumoniae 


TW-183 


[31 27 22 19] 


NO DATA 


[32 27 27 29] 


Chlamydophila 
pneumoniae 


AR39 


[31 27 22 19] 


NO DATA 


[32 27 27 29] 


Chlamydophila 
pneumoniae 


CWL029 


[31 27 22 19] 


NO DATA 


[32 27 27 29] 


Chlamydophila 
pneumoniae 


J138 


[31 27 22 19] 


NO DATA 


[32 27 27 29] 


Corynebacterium 
diphtheriae 


NCTC13129 


[29 34 21 15] 


[22 38 31 25] 


[22 33 25 34] 


Mycobacterium 


klO 


[27 36 21 15] 


[22 37 30 28] 


[21 36 27 30] 


Mycobacteri urn 


104 


[27 36 21 15] 


[22 37 30 28] 


[21 36 27 30] 


Mycobacteri um 
tuberculosis 


CSU#93 


[27 36 21 15] 


[22 37 30 28] 


[21 36 27 30] 


Mycobacterium 


CDC 1551 


[27 36 21 15] 


[22 37 30 28] 


[21 36 27 30] 


Mycobacterium 


H37Rv (lab strain) 


[27 36 21 15] 


[22 37 30 28] 


[21 36 27 30] 


Mycoplasma 
pneumoniae 


M129 


[31 29 19 20] 


NO DATA 


NO DATA 


Staphylococcus 


MRSA252 


[27 30 21 21] 


[25 35 30 26] 


[30 29 30 29] 
[29 31 30 29]* 


Staphylococcus 


MSSA476 


[27 30 21 21] 


[25 35 30 26] 


[30 29 30 29] 
[30 29 29 30]* 


Staphylococcus 


COL 


[27 30 21 21] 


[25 35 30 26] 


[30 29 30 29] 
[30 29 29 30]* 


Staphylococcus 


Mu50 


[27 30 21 21] 


[25 35 30 26] 


[30 29 30 29] 
[30 29 29 30]* 


Staphylococcus 


MW2 


[27 30 21 21] 


[25 35 30 26] 


[30 29 30 29] 
[30 29 29 30]* 
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Staphylococcus 






[25 35 30 26] 


[30 29 30 29] 
[30 29 29 30]* 


Staphylococcus 


NCTC 8325 




[25 35 30 26] 


[30 29 30 29] 


Streptococcus 
agalactiae 






[24 36 31 25] 


[25 32 29 30] 


Streptococcus 


NC 002955 








Streptococcus 
pyogenes 






[24 37 30 25] 




Streptococcus 
pyogenes 


MGAS315 








Streptococcus 
pyogenes 








[25 31 29 31] 


Streptococcus 
pyogenes 


MGAS10394 








Streptococcus 
pyogenes 








[25 31 29 31] 


Streptococcus 
pyogenes 


SF370 (Ml) 








Streptococcus 
pneumoniae 










Streptococcus 




[26 32 23 18] 




[25 32 29 30] 


Streptococcus 
pneumoniae 


TIGR4 


[26 32 23 18] 


[25 35 28 28] 


[25 32 30 29] 


Streptococcus 
gordonii 


NCTC7868 


[25 33 23 18] 


[24 36 31 25] 


[25 31 29 31] 


Streptococcus 
mitls 


NCTC 12261 


[26 32 23 18] 


[25 35 30 26] 


[25 32 29 30] 
[24 31 35 29]* 


Streptococcus 
mutans 


UA15 9 


[24 32 24 19] 


[25 37 30 24] 


[28 31 26 31] 



Table 6B - Base Compositions of Common Respiratory Pathogens for Bioagent Identifying 
Amplicons Corresponding to Primer Pair Nos: 349, 360, and 356 



Organism 


Strain 


[A G C T] 


Primer 360 
[A G C T] 


Primer 356 1 
[A G C T] 


Klebsiella 
pneumoniae 


MGH7B578 


[25 31 25 22] 


[33 37 25 27] 


NO DATA 


yersinia pestis 


CO-92 Biovar 
Orientalis 


[25 31 27 20] 
[25 32 26 2D]* 


[34 35 25 28J 


NO DATA 


Yersinia pestis 


KIM5 P12 (Biovar 
Mediaevalis) 


[25 31 27 20] 
[25 32 26 20]* 


[34 35 25 28] 


NO DATA 


Yersinia pestis 


91001 


[25 31 27 20] 


[34 35 25 28] 




Haemophilus 
influenzae 


KW20 


[28 28 25 20] 


[32 38 25 27] 




Pseudomonas 


PAOl 


[24 31 26 20] 


[31 36 27 27] 
[31 36 27 28]* 


NO DATA 


Pseudomonas 
fluorescens 


Pf0-1 


NO DATA 


[30 37 27 28] 




Pseudomonas 
putida 


KT2440 


[24 31 26 20] 


[30 37 27 28] 




Legionella 
pneumophila 


Philadelphia-1 


[23 30 25 23] 


[30 39 29 24] 




Francisella 
tularensis 


schu 4 


[26 31 25 19] 


[32 36 27 27] 


NO DATA 


Bordetella 
pertussis 




[21 29 24 18] 


[33 36 26 27] 


NO DATA 


Burkholderia 
cepa cia 


J2315 


[23 27 22 20] 


[31 37 28 26] 




pseudomallei 


K96243 


[23 27 22 20] 


[31 37 28 26] 


NO DATA 


gonorrhoeae 


FA 1090, ATCC 700825 


[24 27 24 17] 


[34 37 25 26] 


NO DATA 


meningitidis 


MC58 (serogroup B) 


[25 27 22 18] 


[34 37 25 26] 


NO DATA 


Neisseria 
meningi fcidis 


serogroup C, FAM18 


[25 26 23 18] 


[34 37 25 26] 


NO DATA 


Neisseria 


Z2491 (serogroup A) 


[25 26 23 18; 


[34 37 25 26] 


NO DATA 
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meningitidis 










Chlamydophila 




[30 28 27 18] 






Chlamydophila 
pneumoniae 




[30 28 27 18] 






Chlamydophila 
pneumoniae 


CWL029 


[30 28 27 18] 


NO DATA 


NO DATA 


Ch 1 amydoph ila 
pneumoniae 




[30 28 27 18 ] 


NO DATA 




Corynebacterium 
diphtheriae 


NCTC13129 




[29 40 28 25] 




Mycobacterium 






[33 35 32 22] 




Mycobacterium 










Mycobacteri urn 
tuberculosis 


CSO#93 




[30 36 34 22] 




Mycobacterium 
tuberculosis 


CDC 1551 


NO DATA 


[30 36 34 22] 




Mycobacterium 
tuberculosis 


H37Rv (lab strain) 








Mycoplasma 
pneumoniae 




[28 30 24 19] 






Staphylococcus 


MB.SA252 


[26 30 25 20] 


[31 38 24 29] 


[33 30 31 27] 


Staphylococcus 


MSSA47 6 


[26 30 25 20] 


[31 38 24 29] 


[33 30 31 27] 


Staphylococcus 




[26 30 25 20] 


[31 38 24 29] 




Staphylococcus 




[26 30 25 20] 


[31 38 24 29] 




Staphylococcus 




[26 30 25 20] 


[31 38 24 29] 


[33 30 31 27] 


Staphylococcus 




[25 30 25 20] 


[31 38 24 29] 


[33 30 31 27] 


Staphylococcus 


NCTC 8325 


[26 30 25 20] 


[31 38 24 29] 




Streptococcus 
agalactiae 


NEM316 


[28 31 22 20] 


[33 37 24 28] 




Streptococcus 


NC 002955 


[28 31 23 19] 


[33 38 24 27] 


[37 31 28 25] 


Streptococcus 
pyogenes 


MGAS8232 


[28 31 23 19] 


[33 37 24 28] 


[38 31 29 23] 


Streptococcus 
pyogenes 


MGAS315 


[28 31 23 19] 


[33 37 24 28] 




Streptococcus 
pyogenes 




[28 31 23 19] 


[33 37 24 28] 




Streptococcus 


MGAS10394 


[28 31 23 19] 






Streptococcus 
pyogenes 


Manfredo (M5) 


[28 31 23 19] 






Streptococcus 
pyogenes 


SF370 (Ml) 


[28 31 23 19] 
[23 31 22 20] * 


[33 37 24 28] 




Streptococcus 
pneumoniae 




[23 31 22 20] 


[34 36 24 28] 


[37 30 29 25] 


Streptococcus 




[28 31 22 20] 


[34 36 24 28] 


[37 30 29 25] 


Streptococcus 
pneumoniae 


TIGR4 


[28 31 22 20] 


[34 36 24 28] 


[37 30 29 25] 


Streptococcus 
gordonii 


NCTC7868 


[28 32 23 20] 


[34 36 24 28] 


[36 31 29 25] 


Streptococcus 


NCTC 122 61 


[28 31 22 20] 
[23 30 22 20]* 


[34 36 24 28] 


[37 30 29 25] 


Streptococcus 


UA159 


[26 32 23 22] 


[34 37 24 27] 


NO DATA 



WO 2005/098047 



-54- 



PCT/US2005/005356 



Table 6C - Base Compositions of Common Respiratory Pathogens for Bioagent Identifying 



Amplicons Corresponding to Primer Pair Nos: 449, 354, and 352 



Organism 


Strain 


Primer 449 
[A 6 C T] 


Primer 354 
[A G C T] 


Primer 352 
[A G C T] 


Klebsiella 
pneumoniae 


MGH78578 


NO DATA 


[27 33 36 26] 


NO DATA 


Yersinia pestis 


CO-92 Biovar 
Orientalis 


NO DATA 


[29 31 33 29] 


[32 28 20 25] 


Yersinia pestis 


KIM5 P12 (Biovar 


NO DATA 


[29 31 33 29] 


[32 28 20 25] 


Yersinia pestis 


J1221 


_NO_DATA 


[29 31 33 29] 


NO DATA 


Haemophilus 
influenzae 


_KW20 




[30 29 31 32] 


NO DATA 




PAOl 


—————— 

NO DATA 


[26 33 39 24] 


NO DATA 


ITeldoZTas 

fluoresceins 


Pf0-1 


NO DATA 


[26 33 34 29] 


NO DATA 




KT2440 


M ° DATA 


[25 34 36 27] 


NO DATA 


Legionella 
pneumophila 


Philadelphia-1 


NO DATA 


NO DATA 


NO DATA 


Francisella 
llraZTellt 


schu 4 


NO DATA 


[33 32 25 32] 


NO DATA 


pertussis 


Tohama I 


NO DATA 


[26 33 39 24] 


NO DATA 


Burkholderia 


J2315 


NO DATA 


[25 37 33 27] 


NO DATA 


Burkholderia 
pseudomallei 


K96243 


NO DATA 


[25 37 34 26] 


NO DATA 


Neisseria 
gonorrhoeae 


7fl 1090, ATCC 700825 


[17 23 22 10] 


[29 31 32 30] 


NO DATA 


Neisseria 
meningitidis 


MC58 (serogroup B) 


NO DATA 


[29 30 32 31] 


NO DATA 


Neisseria 
meningitidis 


ssrogroup C, EAM18 


NO DATA 


[29 30 32 31] 


NO DATA 


Neisseria 
meningitidis 


Z2491 (serogroup A) 


NO DATA 


[29 30 32 31] 


NO DATA 


Chlamydophila . 
pneumoniae 


TW-183 


NO DATA 


NO DATA 


NO DATA 


Chlamydophila 
pneumoniae 


AR39 


NO DATA 


NO DATA 


NO DATA 


Chlamydophila 
pneumoniae 


CWL029 


NO DATA 


NO DATA 


_NO_DATA 


Chlamydophila 
pneumoniae 


J138 


NO DATA 


NO DATA 


NO DATA 


Coryneba cteri urn 
diphtheriae 


NCTC13129 


NO DATA 


NO DATA 


NO DATA 


Mycobacterium 




NO DATA 


NO DATA 


NO DATA 


Mycobacterium 


— 

104 


NO DATA 


NO DATA 


NO DATA 


Mycobacterium 


CSU#93 


NO DATA 


NO DATA 


NO DATA 


Mycobacterium 
tuberculosis 


CDC 1551 


NO DATA 


NO DATA 


NO DATA 


Mycobacterium 
tuberculosis 


H37Rv (lab strain) 


NO DATA 


NO DATA 


NO DATA 


Mycoplasma 


M12 9 


NO DATA 


NO DATA 


NO DATA 


Staphylococcus 


MRSA252 


[17 20 21 17] 


[30 27 30 35] 


[36 24 19 26] 


Staphylococcus 


MSSA476 


[17 20 21 17] 


[30 27 30 35] 


[36 24 19 26] 


Staphylococcus 


COL 


[17 20 21 17] 


[30 27 30 35] 


[35 24 19 27] 


Staphylococcus 


MU50 


[17 20 21 17] 


[30 27 30 35] 


[36 24 19 26] 


Staphylococcus 
aureus 


MW2 


[17 20 21 17] 


[30 27 30 35] 


[36 24 19 26] 
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Staphylococcus 








[36 24 19 26] 


Staphylococcus 


NCTC 8325 




[30 27 30 35] 


[35 24 19 27] 


Streptococcus 
agalactiae 


NEM316 






[29 26 22 28] 


Streptococcus 




[22 21 19 13] 


NO DATA 


NO DATA 


Streptococcus 
pyogenes 


MGAS8232 




[ 1 


NO DATA 


Streptococcus 
pyogenes 


MGAS315 




[24 32 30 36] 


NO DATA 


Streptococcus 
pyogenes 




[23 21 19 12] 


[ 1 


NO DATA 


Streptococcus 
pyogenes 


MGAS10394 


[23 21 19 12] 


[ 1 


NO DATA 


Streptococcus 
pyogenes 








NO DATA 


Streptococcus 
pyogenes 


SF370 (Ml) 






NO DATA 


pneumoniae 




[22 20 19 14] 


[25 33 29 35] 


[30 29 21 25] 


Streptococcus 










pneumoniae 


TIGR4 


[22 20 19 14] 


[25 33 29 35] 


[30 29 21 25] 


Streptococcus 
gordonii 




[21 21 19 14] 


NO DATA 


[29 26 22 28] 


Streptococcus 


NCTC 122 61 


[22 20 19 14] 


[26 30 32 34] 


NO DATA 


Streptococcus 


UA15 9 


NO DATA 


NO DATA 


NO DATA 



Table 6D - Base Compositions of Common Respiratory Pathogens for Bioagent Identifying 
Amplicons Corresponding to Primer Pair Nos: 355, 358, and 359 





Strain 


Primer 355 
[A G C T] 


Primer 358 
[A G C T] 


Primer 359 
[A G C T] 


Klebsiella 
pneumoniae 


MGH78578 


NO DATA 


[24 39 33 20] 


[25 21 24 17] 


Yersinia pestis 


CO-92 Biovar 
Orientalis 


NO DATA 


[26 34 35 21] 


[23 23 19 22] 


Yersinia pestis 


KIM5 P12 (Biovar 
Mediaevalis) 


NO DATA 


[26 34 35 21] 


[23 23 19 22] 


Yersinia pestis 


91001 


NO DATA 


[26 34 35 21] 


[23 23 19 22] 


Haemophilus 
influenzae 


KW20 


NO DATA 


NO DATA 


NO DATA 


Pseudomonas 


PAOl 


NO DATA 


NO DATA 


NO DATA 


Pseudomonas 


Pf0-1 


NO DATA 


NO DATA 


NO DATA 




KT2440 


NO DATA 


[21 37 37 21] 


NO DATA 


Legionella 






NO DATA 


NO DATA 


tularensis 






NO DATA 


NO DATA 


Bordetella 


Tohama I 


NO DATA 






Burkholderia 


J2315 


NO DATA 


NO DATA 




Burkholderia 
pseudomallei 


K96243 


NO DATA 


NO DATA 


NO DATA 


Neisseria 
gonorrhoeae 


FA 1090, ATCC 700825 


NO DATA 


NO DATA 


NO DATA 


Neisseria 
meningitidis 


MC58 (serogroup B) 


NO DATA 


NO DATA 


NO DATA 


Neisseria 
meningitidis 


serogroup C, FAM18 


NO DATA 


NO DATA 


NO DATA 
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Mycobacterium 



Mycobacteria 



Myc oba cterium 



H37Rv (lab £ 



Staphylococcus 



Staphylococcus 



Manfredo (M5) 



Streptococcus 



Streptococcus 



Streptococcus 
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Table 6E - Base Compositions of Common Respiratory Pathogens for Bioagent Identifying 



Annplicons Corresponding to Primer Pair Nos: 362, 363, and 367 



Organism 


Strain 


Primesr 362 
[A G C T] 


Primer 363 
[A G C T] 


Primer 367 
[A G C T] 


Klebsiella 
pneumoniae 


MGH78578 


[21 33 22 16] 


[16 34 26 26] 


NO DATA 


Yersinia pestis 


CO-92 Biovar 
Orientalis 


[20 34 18 20] 


NO DATA 


NO DATA 


Yersinia pestis 


KIM5 P12 (Biovar 
Mediaevalis) 


[20 34 18 20] 


NO DATA 


NO DATA 




_91001 


[20 34 IS 20] 


NO DATA 


NO DATA 


lZlophilu e s StlS 
influenzae 


_KW20 


NO DATA 


NO DATA 


NO DATA 


Pseudomonas 


PAOl 


[19 35 21 17] 


[16 35 28 22] 


m DATA 


PseudorconL 

fluorescens 


Pf0-1 


NO DATA 


[18 35 26 23] 


NO DATA 


Pseudomonas 


KT2440 


NO DATA 


[16 35 28 23] 


NO DATA 


TelTonella 

pneumophila 


Philadelphia-1 


NO DATA 


NO DATA 


N0 DRTA 


Francisella 


SChU 4 


NO DATA 


NO DATA 


NO DATA 


Bordetella 
pertussis 


T ° hama 1 


[20 31 24 17] 


[15 34 32 21] 


[26 25 34 19] 


Burkholderia 
Turkhtlderia 


J2315 


[20 33 21 18] 


[15 35 26 25] 


[25 27 32 20] 




K96243 


[19 34 19 20] 


[15 37 28 22] 


[25 27 32 20] 


Neisseria 
gonorrhoeae 


FA 1090, ATCC 700825 


NO DATA 


NO DATA 


NO DATA 


meningitidis 


MC58 (serogroup B) 


NO DATA 


NO DATA 


NO DATA 


Neisseria 
meningitidis 


serogroup C, FAM18 


NO DATA 


NO DATA 


NO DATA 


Neisseria 
meningitidis 


Z2491 (serogroup A) 


NO DATA 


NO DATA 


NO DATA 


Chlamydophila 
pneumoniae 


TW-183 


NO DATA 


NO DATA 


NO DATA 


Chlamydophila 
pneumoniae 


AR39 


NO DATA 


NO DATA 


NO DATA 


Chlamydophila 
pneumoniae 


CWL029 


NO DATA 


NO DATA 


NO DATA 


Chlamydophila 
pneumoniae 


J138 


NO DATA 


NO DATA 


NO DATA 


Corynebacterium 
diphtheriae 


NCTC13129 


NO DATA 


NO DATA 


NO DATA 


Mycobacterium 


klO 


[19 34 23 16] 


NO DATA 


[24 26 35 19] 


Mycobacterium 


_104 


[19 34 23 16] 


NO DATA 


[24 26 35 19] 


Mycobacterium 
tuberculosis 


CSD#93 


[19 31 25 17] 


NO DATA 


[25 25 34 20] 


Mycobacterium 
tuberculosis 


CDC 1551 


[19 31 24 18] 


NO DATA 


[25 25 34 20] 


Mycobacterium 
tuberculosis 


H37Rv (lab strain) 


[19 31 24 18] 


NO DATA 


[25 25 34 20] 


Mycoplasma 
pneumoniae 


M129 


NO DATA 


NO DATA 


NO DATA 


Staphylococcus 


MRSA252 


NO DATA 


NO DATA 


NO DATA 


Staphylococcus 


MSSA476 


NO DATA 


NO DATA 


NO DATA 


Staphylococcus 


COL 


NO DATA 


NO DATA 


NO DATA 


Staphylococcus 




NO DATA 


NO DATA 


NO DATA 


Staphylococcus 


MW2 


NO DATA 


NO DATA 


NO DATA 


Staphylococcus 


N315 


NO DATA 


NO DATA 


NO DATA 
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Staphylococcus 


NCTC 8325 


NO DATA 






Streptococcus 




NO DATA 


NO DATA 




Streptococcus 










Streptococcus 


MGAS8232 


NO DATA 


NO DATA 




Streptococcus 
pyogenes 


MGAS315 




NO DATA 


NO DATA 


Streptococcus 










Streptococcus 










Streptococcus 
pyogenes 


Manfredo (M5) 


NO DATA 


NO DATA 


NO DATA 


Streptococcus 
pyogenes 


SF370 (Ml) 


NO DATA 


NO DATA 


NO DATA 


Streptococcus 
pneumoniae 




NO DATA 


NO DATA 


NO DATA 


Streptococcus 




[20 30 19 23] 






Streptococcus 


TIGR4 


[20 30 19 23] 


NO DATA 


NO DATA 


Streptococcus 
gordonii 


NCTC7868 


NO DATA 


NO DATA 


NO DATA 


Streptococcus 
mitis 


NCTC 122 61 


NO DATA 


NO DATA 


NO DATA 


Streptococcus 


UA15 9 


NO DATA 


NO DATA 


NO DATA 



[0121] Four sets of throat samples from military recruits at different military facilities taken at 
different time points were analyzed using the primers of the present invention. The first set was 
collected at a military training center from November 1 to December 20, 2002 during one of the 
most severe outbreaks of pneumonia associated with group A Streptococcus in the United States 
since 1968. During this outbreak, fifty-one throat swabs were taken from both healthy and 
hospitalized recruits and plated on blood agar for selection of putative group A Streptococcus 
colonies. A second set of 1 5 original patient specimens was taken during the height of this group 
A Streptococcus -associated respiratory disease outbreak. The third set were historical samples, 
including twenty-seven isolates of group A Streptococcus, from disease outbreaks at this and 
other military training facilities during previous years. The fourth set of samples was collected 
from five geographically separated military facilities in the continental U.S. in the winter 
immediately following the severe November/December 2002 outbreak. 

[0122] Pure colonies isolated from group A Streptococcus-selective media from all four 
collection periods were analyzed with the surveillance primer set. All samples showed base 
compositions that precisely matched the four completely sequenced strains of Streptococcus 
pyogenes. Shown in Figure 4 is a 3D diagram of base composition (axes A, G and C) of 
bioagent identifying amplicons obtained with primer pair number 14 (a precursor of primer pair 
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number 348 which targets 16S rRNA). The diagram indicates that the experimentally determined 
base compositions of the clinical samples closely match the base compositions expected for 
Streptococcus pyogenes and are distinct from the expected base compositions of other 
organisms. 

[0123] In addition to the identification of Streptococcus pyogenes, other potentially pathogenic 
organisms were identified concurrently. Mass spectral analysis of a sample whose nucleic acid 
was amplified by primer pair number 349 (SEQ ID NOs: 49 and 405) exhibited signals of 
bioagent identifying amplicons with molecular masses that were found to correspond to 
analogous base compositions of bioagent identifying amplicons of Streptococcus pyogenes QA21 
G32 C24 T18), Neisseria meningitidis (A25 G27 C22 T18), and Haemophilus influenzae (A28 
G28 C25 T20) (see Figure 5 and Table 6B). These organisms were present in a ratio of 4:5:20 as 
determined by comparison of peak heights with peak height of an internal PCR calibration 
standard as described in commonly owned U. S. Patent Application Serial No: 60/545,425 winch 
is incorporated herein by reference in its entirety. 

[0124] Since certain division-wide primers th at target housekeeping genes are designed to 
provide coverage of specific divisions of bacteria to increase the confidence level for 
identification of bacterial species, they are not expected to yield bioagent identifying amplicons 
for organisms outside of the specific divisions. For example, primer pair number 356 (SEQ ID 
NOs: 232:592) primarily amplifies the nucleic acid of members of the classes Bacilli and 
Clostridia and is not expected to amplify proteobacteria such as Neisseria meningitidis and 
Haemophilus influenzae. As expected, analysis of the mass spectrum of amplification products 
obtained with primer pair number 356 does not indicate the presence of Neisseria meningitidis 
and Haemophilus influenzae but does indicate the presence of Streptococcus pyogenes (Figures 3 
and 6, Table 6B). Thus, these primers or types of primers can confirm the absence of particular 
bioagents from a sample. 

[0125] The 1 5 throat swabs from military recruits were found to contain a relatively small set of 
microbes in high abundance. The most comnxon were Haemophilus influenza, Neisseria 
meningitides, and Streptococcus pyogenes. Staphylococcus epidermidis, Moraxella cattarhctlis, 
Corynebacterium pseudodiphtheriticum, and Staphylococcus aureus were present in fewer 
samples. An equal number of samples from healthy volunteers from three different geographic 
locations, were identically analyzed. Results indicated that the healthy volunteers have bacterial 
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flora dominated by multiple, commensal non-beta-hemolytic Streptococcal species, including the 
viridans group streptococci (S. parasangunis, S. vestibularis,, S. mitis, S. oralis and S. 
pneumoniae; data not shown), and none of the organisms found in the military recruits were 
found in the healthy controls at concentrations detectable by mass spectrometry. Thus, the 
military recruits in the midst of a respiratory disease outbreak had a dramatically different 
microbial population than that experienced by the general population in the absence of epidemic 
disease. 

[0126] Example 8: Drill-down Analysis for Determination of emm-Type of Streptococcus 
pyogenes in Epidemic Surveillance 

[0127] As a continuation of the epidemic surveillance investigation of Example 7, determination 
of sub-species characteristics (genotyping) of Streptococcus pyogenes, was carried out based on 
a strategy that generates strain-specific signatures according to the rationale of Multi-Locus 
Sequence Typing (MLST). In classic MLST analysis, internal fragments of several housekeeping 
genes are amplified and sequenced (Enright et al. Infection and Immunity, 2001, 69, 2416-2427). 
In classic MLST analysis, internal fragments of several housekeeping genes are amplified and 
sequenced. In the present investigation, bioagcnt identifying amplicons from housekeeping genes 
were produced using drill-down primers and analyzed by mass spectrometry. Since mass spectral 
analysis results in molecular mass, from which base composition can be determined, the 
challenge was to determine whether resolution of emm classification of strains of Streptococcus 
pyogenes could be determined. 

[0128] An alignment was constructed of concatenated alleles of seven MLST housekeeping 
genes (glucose kinase (gki), glutamine transporter protein (gtr), glutamate racemase (murl), 
DNA mismatch repair protein (mutS), xanthine phosphoribosyl transferase (xpt), and acetyl-CoA 
acetyl transferase (yqiL)) from each of the 212 previously eww-typed strains of Streptococcus 
pyogenes. From this alignment, the number and location of primer pairs that would maximize 
strain identification via base composition was detexmined. As a result, 6 primer pairs were 
chosen as standard drill-down primers for determination of emm-type of Streptococcus pyogenes. 
These six primer pairs are displayed in Table 7. Ttiis drill-down set comprises primers with T 
modifications (note TMOD designation in primer names) which constitutes a functional 
improvement with regard to prevention of non-templated adenylation (vide supra) relative to 
originally selected primers which are displayed below in the same row. 
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Table 7: Group A Streptococcus Drill-Down Primer Pairs 



Primer 
Pair No. 


Forward Primer Name 


Forward Primer 
(SEQ ID NO:) 


Reverse Primer Name 


Reverse Primer 
CSEQ ID NO:) 


Target Gene 


442 


SP101_SPET11 358 387 
TMOD_F 


311 


SP101 SPET11 448 
473_TMOD_R 


S69 


gki 




SP101 SPET11 358 387 




SP101 SPET11 448 
473 TMOD R 


668 




443 


SP101_SPET11_600_629_ 
TMOD_F 


314 


SP101 SPETll 686 
714_TMOD_R 


S71 


gtr 




SP101 SPET11 600 629 




SP101 SPETll 68 6 
714 R 


670 




426 


SP101 SPET11 1314 133 
6_TMOD_F 


278 


SP101_SPET11 1403 
_1431_TMOD_R 


633 






SP101 SPETll 1314 133 
6 F 




SP101 SPETll 1403 
1431 R 






430 


SP101_SPET11 1807 183 
5_TMOD_F 


286 


SP101 SPETll 1901 
_1927_TMOD_R 


641 




90 


SP101_SPET11 1807_183 
5 F 


285 


SP101 SPETll 1901 
1927 R 


640 






SP101 SPET11 3075 310 
3_TMOD_F 


302 


3P101 SPETll 3168 
_3196_TMOD_R 


657 


xpt 


96 


SPiai_SPETU 3075 310 
3 F 


301 


3P101 SPETll 3168 
3196 R 


656 


xpt 


441 


SP101_SPET11_3511_353 


309 


SP101 SPETll 3605 
_3629_TMOD_R 


664 


yqiL 


98 


SP101 SPET11 3511 353 
5 F 


308 


SP101 SPETll 3605 
3629 R 


663 


yqiL 



[0129] The primers of Table 7 were used to produce bioagent identifying amplicons from nucleic 
acid present in the clinical samples. The bioagent identifying amplicons which were 
subsequently analyzed by mass spectrometry and base compositions corresponding to the 
molecular masses were calculated. 

[0130] Of the 51 samples taken during the peak of the November/December 2002 epidemic 
(Table 8A-C rows 1-3), all except three samples were found to represent emmS, a Group A 
Streptococcus genotype previously associated with high respiratory -virulence. The three outliers 
were from samples obtained from healthy individuals and probably represent non-epidemic 
strains. Archived samples (Tables 8A-C rows 5-13) from historical collections showed a greater 
heterogeneity of base compositions and emm types as would be expected from different 
epidemics occurring at different places and dates. The results of the mass spectrometry analysis 
and emm gene sequencing were found to be concordant for the epidemic and historical samples. 
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Table 8A: Base Composition Analysis of Bioagent Identifying Amplicons of Group A 



Streptococcus samples from Six Military Installations Obtained with Primer Pair Nos. 426 
and 430 



# of 

Instances 


emm-type by 

iTactromet 
pec rome ry 


emm-Gene 
Sequencing 


Location 
(sample) 


Year 


murl 

Ho rl 426) Pair 
o. 


mutS 

No^IsO) Pa±r 
o. 


48 




3 


MCRD San 
Diego 

(Cultured) 


2002 


A39 G25 C20 T34 


A38 G27 C23 T33 




— 

J 




A40 G24 C20 T34 


A38 G27 C23 T33 


— 

-i 


28 


— 

28 


A39 G25 C20 T34 


A38 G27 C23 T33 


15 


3 


ND 


A39 G25 C20 T34 


A38 G27 C23 T33 


6 


3 


3 


NHRC San 


2003 


A39 G25 C20 T34 


A38 G27 C23 T33 


3 


5,58 


5 


A40 G24 C20 T34 


A38 G27 C23 T33 


6 


JL 


6 


A40 G24 C20 T34 


A38 G27 C23 T33 








(Cultured) 


A39 G25 C20 T34 


A38 G27 C23 T33 


3 


12 


12 


A40 G24 C20 T34 


A38 G26 C24 T33 




22 


22 


A39 G25 C20 T34 


A38 G27 C23 T33 


3 


25,75 


75 


A39 G25 C20 T34 


A38 G27 C23 T33 


4 


44/61,82,9 


44/61 


A40 G24 C20 T34 


A38 G26 C24 T33 


2 


53,91 


91 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


2 


2 


Ft. 
Wood 

(Cultured) 


2003 


A39 G25 C20 T34 


A38 G27 C24 T32 


2 


3 


3 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


4 


4 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


6 


6 


A40 G24 C20 T34 


A38 G27 C23 T33 


11 


25 or 75 


75 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


25,75, 33, 
34,4,52,84 


75 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


44/61 or 82 
or 9 


4 4/61 


A40 G24 C20 T34 


A38 G26 C24 T33 


2 


5 or 58 


5 


A40 G24 C20 T34 


A38 G27 C23 T33 


3 




1 


(Cultured) 


2003 


A40 G24 C20 T34 


A38 G27 C23 T33 


2 


3 




A39 G25 C20 T34 


A38 G27 C23 T33 








A39 G25 C20 T34 


A38 G27 C23 T33 


1 


28 


28 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


3 


3 


Ft. 

Benning 
(Cultured) 


2003 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


4 


4 


A39 G25 C20 T34 


A38 G27 C23 T33 


3 


6 


6 


A40 G24 C20 T34 


A38 G27 C23 T33 


1 


11 


11 


A39 G25 C20 T34 


A38 G27 C23 T33 


1 


13 


94** 


A40 G24 C20 T34 


A38 G27 C23 T33 


— 


44/61 or 82 
° r 9 


_82 


A40 G24 C20 T34 


A38 G26 C24 T33 


J 






A40 G24 C20 T34 


A38 G27 C23 T33 




^ r 5 g 9 


~89 ' 




A38 G27 C23 T33 


2 


5 or 58 




Lackland 
AFB 

(Throat 


2003 


A40 G24 C20 T34 


A38 G27 C23 T33 




2 


A39 G25 C20 T34 


A38 G27 C24 T32 


1 


81 or 90 


A40 G24 C20 T34 


A38 G27 C23 T33 


1 


78 


A38 G26 C20 T34 


A38 G27 C23 T33 




No detection 


No detection 


No detection 


7 


3 


ND 


(Throat 


2002 


A39 G25 C20 T34 


A38 G27 C23 T33 




3 




No detection 


A38 G27 C23 T33 






ND 


No detection 


No detection 




3 


ND 


No detection 


No detection 


2 


3 


ND 


No detection 


A38 G27 C23 T33 


3 


No detection 


ND 


No detection 


No detection 
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Table 8B: Base Composition Analysis of Bioagent Identifying Amplicons of Group A 
Streptococcus samples from Six Military Installations Obtained with Primer Pair Nos. 438 
and 441 



# of 

Instances 


emm-type by 
Spectrometry 


emm-Gene 
Sequencing 


(sample) 


Tear 


acpt 

No. 438) 


yqiL 

No. 441) 


48 


3 


3 


MCRD San 


2002 


A30 G36 C20 T36 


A40 G2 9 C19 T31 


2 


6 .. . 


6 


(Cultured) 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


28 


28 


A30 G36 C20 T36 


A41 G28 C18 T32 


15 


3 


ND 


A30 G36 C20 T36 


A40 G29 C19 T31 


6 


3 


3 




2003 


A30 G36 C20 T36 


A40 G29 C19 T31 


3 


5,58 


5 


A30 G36 C20 T36 


A40 G29 C19 T31 


6 


6 


6 


A30 G36 C20 T36 


A40 G2 9 C19 T31 


1 




11 


NHRC San 
(Cultured) 


A30 G36 C20 T36 


A40 G29 C19 T31 




12 




A30 G36 C19 T37 


A40 G29 C19 T31 


1 


22 


22 


A30 G36 C20 T36 


A40 G29 C19 T31 


3 


25,75 


75 


A30 G36 C20 T36 


A40 G29 C19 T31 


4 


44/61,82,9 


44/61 


A30 G36 C20 T36 


A41 G28 C19 T31 


2 


53,91 


91 


A30 G36 C19 T37 


A40 G29 C19 T31 


1 


2 


2 


Ft. 

Leonard 


2003 


A30 G36 C20 T36 


A40 G29 C19 T31 


2 


3 


3 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


4 


4 


A30 G36 C19 T37 


A41 G28 C19 T31 


1 


6 


6 


A30 G36 C20 T36 


A40 G29 C19 T31 


11 


25 or 75 


75 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


25,75, 33, 
34,4,52,84 


75 


A30 G36 C19 T37 


A40 G29 C19 T31 


1 


0/9 


44/61 


A30 G36 C20 T36 


A41 G2 8 C19 T31 


2 


5 or 58 


5 


A30 G36 C20 T36 


A40 G29 C19 T31 


3 


1 


1 


(Cultured, 


2003 


A30 G36 C19 T37 


A40 G29 C19 T31 




3 


3 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


4 


4 


A30 G36 C19 T37 


A41 G28 C19 T31 


1 


28 


28 


A30 G36 C20 T36 


A41 G28 C18 T32 


1 


3 


3 


Ft. 

Benning 
(Cultured) 


2003 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


4 




A30 G36 C19 T37 


A41 G28 C19 T31 


3 


6 


6 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


11 


11 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


13 


94** 


A30 G36 C20 T36 


A41 G28 C19 T31 




44/61 or 82 
or 9 


82 


A30 G36 C20 T36 


A41 G28 C19 T31 


— 


5 or 58 


_58 


A30 G36 C20 T36 


A40 G29 C19 T31 


— 










2 


5 or 58 


ND 


Lackland 
AFB 

Swabs ) 


2003 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


2 


A30 G36 C20 T36 


A40 G29 C19 T31 




81 or 90 


A30 G36 C20 T36 


A40 G29 C19 T31 


1 


78 


A30 G36 C20 T36 


A41 G28 C19 T31 




No detection 


No detection 


No detection 


7 


3 


ND 


MCRD San 

(Throat 
Swabs) 


2002 


A30 G36 C20 T36 


A40 G29 C19 T31 




3 


ND 


A30 G36 C20 T36 


A40 G2 9 C19 T31 


1 


3 




A30 G36 C20 T36 


No detection 


1 


3 




No detection 


A40 G29 C19 T31 


2 


3 


ND 


A30 G36 C20 T36 


A40 G29 C19 T31 


3 


No detection 


ND 


No detection 


No detection 
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Table 8C: Base Composition Analysis of Bioagent Identifying Amplicons of Gronp A 
Streptococcus samples from Six Military Installations Obtained with Primer Pair Nos. 438 
and 441 



emm-type by 
Spectrometry 



30 G36 C17 T33 



A32 G35 C17 T32 



A32 G35 C17 T32 



A30 G36 C20 T30 



A31 G35 C17 T33 



A30 G36 C20 T30 



A31 G35 C17 



A31 G35 C17 



A30 G36 C17 T33 



A32 G35 C17 T32 



3 G36 C17 T33 



A32 G35 C17 T32 



A31 G35 C17 T33 



Leonard 
(Cultured) 



A31 G35 C17 T33 



A30 G36 C17 T33 



A3Q G36 C17 T33 



A30 G36 C18 T32 



A30 G36 C20 T30 



Ft. Sill 
(Cultured) 



A30 G36 C18 T32 



A31 G35 C17 T33 



) G36C17T33 



A32G35C17T32 



A31G35C17 



A31 G35 C17 T33 



Bennlng 
(Cultured) 



A30 G36 C20 T30 



A30 G36 C19 1 



A30 G36 C18 T32 



A30 G36 C20 1 



A30 G36 C17 ] 



A30 G35 C17T33 



A3D G35C18T32 



A32 G35 C17 T32 



A32 G35 C17 T32 
A32 G35 C17 T32 
A32 G35 C17 T32 
No detection 
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[0131] Example 9: Design of Calibrant Polynucleotides based on Bioagent Identifying 
Amplicons for Identification of Species of Bacteria (Bacterial Bioagent Identifying 
Amplicons) 

[0132] This example describes the design of 19 calibrant polynucleotides based on bacterial 
bioagent identifying amplicons corresponding to the primers of the broad surveillance set (Table 
4) and the Bacillus anthracis drill-down set (Table 5). 

[0133] Calibration sequences were designed to simulate bacterial bioagent identifying amplicons 
produced by the T modified primer pairs shown in Table 4 (primer names have the designation 
"TMOD"). The calibration sequences were chosen as a representative member of the section of 
bacterial genome from specific bacterial species which would be amplified by a given primer 
pair. The model bacterial species upon which the calibration sequences are based are also shown 
in Table 9. For example, the calibration sequence chosen to correspond to an amplicon produced 
by primer pair no. 361 is SEQ ID NO: 722. In Table 9, the forward (_F) or reverse (_R) primer 
name indicates the coordinates of an extraction representing a gene of a standard reference 
bacterial genome to which the primer hybridizes e.g.: the forward primer name 
16S_EC_713_732_TMOD_F indicates that the forward primer hybridizes to residues 713-732 of 
the gene encoding 16S ribosomal RNA in an E. coli reference sequence (in this case, the 
reference sequence is an extraction consisting of residues 4033120-4034661 of the genomic 
sequence of E. coli K12 (GenBank gi number 16127994). Additional gene coordinate reference 
information is shown in Table 10. The designation "TMOD" in the primer names indicates that 
the 5' end of the primer has been modified with a non-matched template T residue which 
prevents the PCR polymerase from adding non-templated adenosine residues to the 5' end of the 
amplification product, an occurrence which may result in miscalculation of base composition 
from molecular mass data (vide supra). 

[0134] The 19 calibration sequences described in Tables 9 and 10 were combined into a single 
calibration polynucleotide sequence (SEQ ID NO: 741 - which is herein designated a 
"combination calibration polynucleotide") which was then cloned into a pCR®-Blunt vector 
(Invitrogen, Carlsbad, CA). This combination calibration polynucleotide can be used in 
conjunction with the primers of Table 9 as an internal standard to produce calibration amplicons 
for use in determination of the quantity of any bacterial bioagent. Thus, for example, when the 
combination calibration polynucleotide vector is present in an amplification reaction mixture, a 
calibration amplicon based on primer pair 346 (16S rRNA) will be produced in an amplification 
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reaction with primer pair 346 and a calibration amplicon based on primer pair 363 (rpoC) will be 
produced with primer pair 363. Coordinates of each of the 19 calibration sequences within the 
calibration polynucleotide (SEQ ID NO: 783) are indicated in Table 10. 

Table 9: Bacterial Primer Pairs for Production of Bacterial Bioagent Identifying 
Amplicons and Corresponding Representative Calibration Sequences 



Calibral 
Sequent 
(SEQ II 



16S_EC_1175_1196_T 



23S_EC_:90 6_1924_TMOD_E 



!_EC_2646_2667_TMO 



23S_EC_2745_2765. 



34 9_37 6_IMOD_R 



ft_1353_1379_TMO 



8_14 67_TMOD_I 



3_EC_1365_1393_1 



39_1467_TMCD_ 



LEF_BA_756_7 81_1 



LE F_BA_8 4 3_8 7 2_TMOD_R 



RPLB_E( _" )_7c _I 1 _l 



RPLB_EC_7 37_758_R 



RPOB_EC_l 8 4 S_l 8 6 6_TM 



RPOC EC 2218 2241 



TUFB EC 957 97 9 TMOD 



TUFB_EC_1034_105B_1 



Table 10: Primer Pair Gene Coordinate References and Calibration Polynucleotide 
Sequence Coordinates within the Combination Calibration Polynucleotide 



4033120. .4034661 
4033120. .4034661 
4166220. .4169123 
4166220. .4169123 



1591. .1672 



32928. .4187151 



1810. .1926 



3449001. .3448180 
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"SIpC I 56074.. 55628 {complement I 6470151 (P) p50 | 2517.. 2616 

~5yl 156626.. 154288 4894216 (P) 351 1338. .1449 

J3. antJiracis (complement strand) 

lef 127442. .129921 4894216 (P) 353 1121. .1234 

B. anthracis 

5S P E 226496. .226783 30253828 (G) 355 1007-1104 



[0135] Example 10: Use of a Calibration Polynucleotide for Determining the Quantity of 
Bacillus Anthracis in a Sample Containing a Mixture of Microbes 
[0136] The process described in this example is shown in Figure 7. The capC gene is a gene 
involved in capsule synthesis which resides on the pX02 plasmid of Bacillus anthracis. Primer 
pair number 350 (see Tables 9 and 10) was designed to identify Bacillus anthracis via 
production of a bacterial bioagent identifying amplicon. Known quantities of the combination 
calibration polynucleotide vector described in Example 3 were added to amplification mixtures 
containing bacterial bioagent nucleic acid from a mixture of microbes which included the Ames 
strain of Bacillus anthracis. Upon amplification of the bacterial bioagent nucleic acid and the 
combination calibration polynucleotide vector with primer pair no. 350, bacterial bioagent 
identifying amplicons and calibration amplicons were obtained and characterized by mass 
spectrometry. A mass spectrum measured for the amplification reaction is shown in Figure 8). 
The molecular masses of the bioagent identifying amplicons provided the means for 
identification of the bioagent from which they were obtained (Ames strain of Bacillus anthracis) 
and the molecular masses of the calibration amplicons provided the means for their identification 
as well. The relationship between the abundance (peak height) of the calibration amplicon 
signals and the bacterial bioagent identifying amplicon signals provides the means of calculation 
of the copies of the pX02 plasmid of the Ames strain of Bacillus an thracis. Methods of 
calculating quantities of molecules based on internal calibration procedures are well known to 
those of ordinary skill in the art. 

[0137] Averaging the results of 10 repetitions of the experiment described above, enabled a 
calculation that indicated that the quantity of Ames strain of Bacillus anthracis present in the 
sample corresponds to approximately 10 copies of pX02 plasmid. 

[0138] Example 11: Drill-down Genotyping of Campylobacter Species 
[0139] A series of drill-down primers were designed as described in Example 1 with the 
objective of identification of different strains of Campylobacter jejuni. The primers are listed in 
Table 1 1 with the designation "CJST_CJ." Housekeeping genes to which the primers hybridize 
and produce bioagent identifying amplicons include: tkt (transketolase), glyA (serine 
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hydroxymethyltransferase), gltA (citrate synthase), aspA (aspartate ammonia lyase), glnA 
(glutamine synthase), pgm (phosphoglycerate mutase), and uncA (ATP synthetase alpha chain). 
Table 11: Campylobacter Drill-down Primer Pairs 



Pair 
No. 


Forward Primer Name 


Forward Primer 
(SEQ ID NO:) 


Reverse Primer Name 


Reverse Primer 
(SEQ ID NO:) 


Target Gene 


1053 


CJST CJ 1080 1110 F 


102 


CJST CJ 1166 119B R 


456 


gltA 


1064 


CJST CJ 1680 1713 F 


107 


CJST CJ 1795 1822 R 


4 61 


glyJA 


1054 


CJST CJ 2060 2090 F 


109 


CJST CJ 2148 2174 R 


4 63 


pgm 


1049 


CJST CJ 2636 2668 F 


113 


CJST CJ 2753 2777 R 


4 67 


tkt 


1048 


CJST CJ 360 394 F 


119 


CJST CJ 442 476 R 


472 




1047 


CJST CJ 584 616 F 


121 


CJST CJ 663 692 R 


474 





[0140] The primers were used to amplify nucleic acid from 50 food product samples provided by 
the USD A, 25 of which contained Campylobacter jejuni and 25 of which contained 
Campylobacter coli. Primers used in this study were developed primarily for the discrimination 
of Campylobacter jejuni clonal complexes and for distinguishing Campylobacter jejuni from 
Campylobacter coli. Finer discrimination between Campylobacter coli types is also possible by 
using specific primers targeted to loci where closely-related Campylobacter coli isolates 
demonstrate polymorphisms between strains. The conclusions of the comparison of base 
composition analysis with sequence analysis are shown in Tables 12A-C. 
Table 12A — Results of Base Composition Analysis of 50 Campylobacter Samples with Drill- 
down MLST Primer Pair Nos: 1048 and 1047 



MLST type or 
Clonal 
Complex by 

lalysis ° 



Composition of 



(glnA) 

316 T25 



RM4275 
RM4279 



A30 G25 C15 T 



A30 G25 CI 6 



A30 G25 C16 T 
A30 G25 C15 1 



A30 G25 C15 T 



A30 G25 CIS 1 



A30 G2S C15 T47 



A31 G27 C20 T39 
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A31 G27 C20 T39 
A31 G27 C20 T39 



L G27 C20 T39 
1 G27 C20 T39 



A48 G21 CIS T24 
A48 G21 C16 T24 



A48 G21 C16 T24 



1 G27 C20 T39 
1 G27 C20 T39 



1 G27 C20 T39 
I G27 C20 T39 



1 G27 C20 T39 
1 G27 C20 T39 



1 G27 C20 T39 



1 G27 C20 T39 



A48 G21 C16 T24 



8 G21 C16 T24 



148 G21 C16 T24 



148 G21 CIS T24 
8 G21 C16 T24 



L G27 C20 T39 
I G27 C19 T40 



1 G27 C20 T39 



3 G21 CIS T24 



A48 G21 C16 T24 
A48 G21 CIS T24 



A31 G27 C20 T39 
A31 G27 C20 T39 



A48 G21 C16 T24 



Table 12B — Results of Base Composition Analysis of 50 Campylobacter Samples with Drill- 
down MLST Primer Pair Nos: 1053 and 1064 



T type or 
Complex by 



Identifying 



No: 1053 (g-ltA) 

A24 G25 C23 T 



A40 G29 C29 T45 



A24 G25 C23 T47 



A24 G25 C23 T47 
A24 G25 C23 T 



A39 G30 C25 T 



A39 G30 C28 T46 
A39 G30 C28 T46 
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Table 12C - Results of Base Composition Analysis of 50 Campylobacter Samples with Drill- 
down MLST Primer Pair Nos: 1054 and 1049 



Identifying 



A41 G28 C35 T38 



A42 G28 C36 T36 



1 G29 C35 T37 
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A27 G31 C19 T38 



141 G28 C36 T37 



A27 G32 C19 T37 



V42 G28 C3S T37 



A26 G33 C19 T37 



U2 G28 C35 T37 



A27 G30 C19 T39 



5 G28 C32 T36 



A27 G30 C19 T39 



6 G28 C32 T36 



U6 G28 C32 T36 



U6 G28 C32 T36 



G30 C19 T39 



G30C19T39 



G30 C19 T39 



G30 C19 T39 



A46 G28 C32 T36 



G30 C19 T39 



6 G28 C32 T36 



6 G28 C32 T36 



6 G28 C32 T3 6 



6 G20 C32 T36 



6 G28 C32 T36 



6 G28 C32 T36 



6 G28 C32 T36 



5 G29 C32 T36 



G30 C19 T39 



,5 G29 C32 T36 



,5 G29 C32 T36 



[0141] The base composition analysis method was successful in identification of 12 different 
strain groups. Campylobacter jejuni and Campylobacter coli are generally differentiated by all 
loci. Ten clearly differentiated Campylobacter jejuni isolates and 2 major Campylobacter coli 
groups were identified even though the primers were designed for strain typing of 
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Campylobacter jejuni. One isolate (RM4183) which was designated as Campylobacter jejuni 
was found to group with Campylobacter coli and also appears to actually be Campylobacter coli 
by full MLST sequencing. 

[0142] Example 12: Identification of Acinetobacter baumannii Using Broad Range Survey 
and Division- Wide Primers in Epidemiological Surveillance 

[0143] To test the capability of the broad range survey and division- wide primer sets of Table 4 
in identification of Acinetobacter species, 183 clinical samples were obtained from individuals 
participating in, or in contact with individuals participating in Operation Iraqi Freedom 
(including US service personnel, US civilian patients at the Walter Reed Army Institute of 
Research (WRAIR), medical staff, Iraqi civilians and enemy prisoners). In addition, 34 
environmental samples were obtained from hospitals in Iraq, Kuwait, Germany, the United 
States and the USNS Comfort, a hospital ship. 

[0144] Upon amplification of nucleic acid obtained from the clinical samples, primer pairs 346- 
349, 360, 361, 354, 362 and 363 (Table 4) all produced bacterial bioagent amplicons which 
identified Acinetobacter baumannii in 215 of 217 samples. The organism Klebsiella pneumoniae 
was identified in the remaining two samples. In addition, 14 different strain types (containing 
single nucleotide polymorphisms relative to a reference strain of Acinetobacter baumannii) were 
identified and assigned arbitrary numbers from 1 to 14. Strain type 1 was found in 134 of the 
sample isolates and strains 3 and 7 were found in 46 and 9 of the isolates respectively. 

[0145] The epidemiology of strain type 7 of Acinetobacter baumannii was investigated. Strain 7 
was found in 4 patients and 5 environmental samples (from field hospitals in Iraq and Kuwait). 
The index patient infected with strain 7 was a pre-war patient who had a traumatic amputation in 
March of 2003 and was treated at a Kuwaiti hospital. The patient was subsequently transferred to 
a hospital in Germany and then to WRAIR. Two other patients from Kuwait infected with strain 
7 were found to be non-infectious and were not further monitored. The fourth patient was 
diagnosed with a strain 7 infection in September of 2003 at WRAIR. Since the fourth patient was 
not related involved in Operation Iraqi Freedom, it was inferred that the fourth patient was the 
subject of a nosocomial infection acquired at WRA.IR as a result of the spread of strain 7 from 
the index patient. 
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[0146] The epidemiology of strain type 3 of Acinetobacter baumannii was also investigated. 
Strain type 3 was found in 46 samples, all of which were from patients (US service members, 
Iraqi civilians and enemy prisoners) who were treated on the USNS Comfort hospital ship and 
subsequently returned to Iraq or Kuwait. The occurrence of strain type 3 in a single locale may 
provide evidence that at least some of the infections at that locale were a result of a nosocomial 
infections. 

[0147] This example thus illustrates an embodiment of the present invention wherein the 
methods of analysis of" bacterial bioagent identifying amplicons provide the means for 
epidemiological surveillance. 

[0148] Example 13: Selection and Use of MLST Acinetobacter baumanii Drill-down Primers 
[0149] To combine the power of high-throughput mass spectrometric analysis of bioagent 
identifying amplicons -with the sub-species characteristic resolving power provided by multi- 
locus sequence typing (MLST) such as the MLST methods of the MLST Databases at the Max- 
Planck Institute for Infectious Biology (web.mpiib-bcrlin.mpg.de/mlst/dbsMcatarrhalis/ 
documents/primersCatarrhalis_html), an additional 21 primer pairs were selected based on 
analysis of housekeeping genes of the genus Acinetobacter. Genes to which the drill-down 
MLST analogue primers hybridize for production of bacterial bioagent identifying amplicons 
include anthranilate synthase component I (trpE), adenylate kinase (adk:), adenine glycosylase 
(mutY), fumarate hydratase (fumC), and pyrophosphate phospho-hydratase (ppa). These 21 
primer pairs are indicated with reference to sequence listings in Table 1 3. Primer pair numbers 
1 151-1 154 hybridize to and amplify segments of trpE. Primer pair numbers 1 155-1 157 
hybridize to and amplify segments of adk. Primer pair numbers 1 158-1 164 hybridize to and 
amplify segments of mutY. Primer pair numbers 1 165-1 170 hybridize to and amplify segments 
of fumC. Primer pair number 1171 hybridizes to and amplifies a segment of ppa. The primer 
names given in Table 13 indicates the coordinates to which the primers hybridize to a reference 
sequence which comprises a concatenation of the genes TrpE, efp (elongation factor p), adk, 
mutT, fumC, and ppa. For example, the forward primer of primer pair 1 15 1 is named 
AB_MLST- 1 1 -OIF00V_62_9 1_F because it hybridizes to the Acinetobacter MLST primer 
reference sequence of* strain type 1 1 in sample 007 of Operation Iraqi Freedom (OIF) at 
positions 62 to 91. 
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Table 13: MLSX Drill-Down Primers for Identification of Snb-species characteristics 
(Strain Type) of Members of the Bacterial Genus Acinetobacter 



cward Primer Nam 



(SEQ ID HI 



Reverse Primer Name 



B MLST-11-OXFQQ7 62 91 E 



AB MLST-ll-Q] 



U-OXF007 1B5 214 



F007 291 324 B 



B MLST-11-OXF007 260 28£ 



F007 364 393 B 



B MLST-11-OIF007 206 239 E 



AB MLST-11-OIF007 31S 34 



B MLST-11-OXF007 522 552 E 



B MLST-11-OXF007 547 



:F007 71D 736 R 



AB_MLST-11- 

OIF007 1202 X225 F 



AB MLST-11-Q] 



TOO? 1266 1296 R 



TOO? 1299 1316 B 



AB_MLST- 



AB MLST-11-Q] 



FQQ7 1335 1362 R 



,327 X356 F 



AB MLST-11-Q] 



FOOT 1422 1448 R 



.345 X369 F 



:f007 1470 1494 R 



AB_MLST- 
OIF007 



T007 1470 1494 R 



AB_MLST- 
OIF007 
AB_MLST-11- 
OIF007 1792 X826 F 
AB_MLST-11- 
OIF007 1792 X826 F 



TOO? 1470 1494 R 



AB ML5T-11-Q] 



:F007 1790 1821 R 



:F007 1876 1909 R 



:F007 1895 1927 R 



WO 2005/098047 PCT/US2005/005356 
-75- 

| | OIF007 1970 2002 F 1 | | | 

[0150] Analysis of bioagent identifying amplicons obtained using the primers of Table 13 for 
over 200 samples from Operation Iraqi Freedom resulted in the identification of 50 distinct strain 
type clusters. The largest cluster, designated strain type 1 1 (ST1 1) includes 42 sample isolates, 
all of which were obtained from US service personnel and Iraqi civilians treated at the 28 th 
Combat Support Hospital in Baghdad. Several of these individuals were also treated on the 
hospital ship USNS Comfort. These observations are indicative of significant epidemiological 
correlation/linkage. 

[0151] All of the sample isolates were tested against a broad panel of antibiotics to characterize 
their antibiotic resistance profiles. As an example of a representative result from antibiotic 
susceptibility testing, ST1 1 was found to consist of four different clusters of isolates, each with a 
varying degree of sensitivity/resistance to the various antibiotics tested which included 
penicillins, extended spectrum penicillins, cephalosporins, carbipenem, protein synthesis 
inhibitors, nucleic acid synthesis inhibitors, anti-metabolites, and anti-cell membrane antibiotics. 
Thus, the genotyping power of bacterial bioagent identifying amplicons, particularly drill-down 
bacterial bioagent identifying amplicons, has the potential to increase the understanding of the 
transmission of infections in combat casualties, to identify the source of infection in the 
environment, to track hospital transmission of nosocomial infections, and to rapidly characterize 
drug-resistance profiles which enable development of effective infection control measures on a 
time-scale previously not achievable. 

[0152] Various modifications of the invention, in addition to those described herein, will be 
apparent to those skilled in the art from the foregoing description. Such modifications are also 
intended to fall within the scope of the appended claims. Each reference (including, but not 
limited to, journal articles, U.S. and non-U.S . patents, patent application publications, 
international patent application publications, gene bank accession numbers, internet web sites, 
and the like) cited in the present application is incorporated herein by reference in its entirety. 
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WHAT IS CLAIMED IS: 

1 . An oligonucleotide primer selected from the group consisting of: an oligonucleotide 
primer 16 to 35 nucleobases in length comprising 80*% to 100% sequence identity with SEQ ID 
NO: 26, an oligonucleotide primer 20 to 27 nucleobases in length comprising at least a 20 
nucleobase portion of SEQ ID NO: 388, an oligonucleotide primer 22 to 35 nucleobases in 
length comprising SEQ ID NO: 29, an oligonucleotide primer 18 to 35 nucleobases in length 
comprising SEQ ID NO: 391, an oligonucleotide primer 22 to 26 nucleobases in length 
comprising SEQ ID NO: 37, an oligonucleotide primer 20 to 30 nucleobases in length 
comprising SEQ ID NO: 362, an oligonucleotide primer 13 to 35 nucleobases in length 
comprising 70% to 100% sequence identity with SEQ ID NO: 48, an oligonucleotide primer 19 
to 35 nucleobases in length comprising SEQ ID NO: 404, an oligonucleotide primer 21 to 35 
nucleobases in length comprising 70% to 100% sequence identity with SEQ ID NO: 160, an 
oligonucleotide primer 21 to 35 nucleobases in length comprising at least a 16 nucleobase 
portion of SEQ ID NO: 515, an oligonucleotide primer 17 to 35 nucleobases in length 
comprising 70% to 100% sequence identity with SEQ ID NO: 261, an oligonucleotide primer 18 
to 35 nucleobases in length comprising at least a 16 nucleobase portion of SEQ ID NO: 624, an 
oligonucleotide primer 21 to 35 nucleobases in length comprising 70% to 100% sequence 
identity with SEQ ID NO: 231, an oligonucleotide primer 17 to 35 nucleobases in length 
comprising 70% to 100% sequence identity with SEQ ID NO: 591; an oligonucleotide primer 14 
to 35 nucleobases in length comprising 70% to 100%» sequence identity with SEQ ID NO: 349, 
an oligonucleotide primer 17 to 35 nucleobases in length comprising 70% to 100% sequence 
identity with SEQ ID NO: 71 1, an oligonucleotide primer 16 to 35 nucleobases in length 
comprising 70% to 100% sequence identity with SEQ ID NO: 240, an oligonucleotide primer 15 
to 35 nucleobases in length comprising 70% to 100% sequence identity with SEQ ID NO: 596, 
an oligonucleotide primer 16 to 35 nucleobases in length comprising 70% to 100% sequence 
identity with SEQ ID NO: 58, an oligonucleotide primer 21 to 35 nucleobases in length 
comprising at least a 16 nucleobase portion of SEQ ID NO:414, an oligonucleotide primer 16 to 
35 nucleobases in length comprising at least a 16 nucleobase portion of SEQ ID NO: 6, an 
oligonucleotide primer 16 to 35 nucleobases in length comprising at least a 16 nucleobase 
portion of SEQ ID NO:369, an oligonucleotide primer 16 to 35 nucleobases in length comprising 
70% to 100% sequence identity with SEQ ED NO: 24-6, an oligonucleotide primer 19 to 35 
nucleobases in length comprising 70% to 100% sequence identity with SEQ ID NO: 602, an 
oligonucleotide primer 21 to 35 nucleobases in length comprising 70% to 100% sequence 
identity with SEQ ID NO: 256, an oligonucleotide primer 14 to 35 nucleobases in length 
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comprising 70% to 100% sequence identity with SEQ ID NO: 620, an oligonucleotide primer 16 
to 35 nucleobases in length comprising 70% to 100% sequence identity with SEQ ID NO: 344, 
an oligonucleotide primer 18 to 35 nucleobases in length comprising 70% to 100% sequence 
identity with SEQ ID NO: 700, an oligonucleotide primer 16 to 35 nucleobases in length 
comprising 70% to 100% sequence identity with SEQ ID NO: 235, an oligonucleotide primer 16 
to 35 nucleobases in length comprising 70% to 100% sequence identity with SEQ ID NO: 587; 

wherein said primer comprises a non-templated T residue on the 5 '-end, or at least one 
non-template tag. 

2. A composition comprising one or more of the oligonucleotide primers of claim 1 . 

3. A composition comprising two or more of the oligonucleotide pximers of claim 1 . 

4. The composition of claim 3 wherein either or both of said oligonucleotide primers 
comprises at least one modified nucleobase. 

5. The composition of claim 3 wherein either or both of said oligonucleotide primers 
comprises a non-templated T residue on the 5'-end. 

6. The composition of claim 3 wherein either or both of said oligonucleotide primers 
comprises at least one non-template tag. 

7. The composition of claim 3 wherein either or both of said oligonucleotide primers 
comprises at least one molecular mass modifying tag. 

8. An oligonucleotide primer selected from the group consisting or: an oligonucleotide 
primer 16 to 35 nucleobases in length comprising 70% to 100% sequence identity with SEQ ID 
NO: 322, and an oligonucleotide primer 19 to 35 nucleobases in length comprising 70% to 100% 
sequence identity with SEQ ID NO: 686. 

9. A composition comprising one or both of the oligonucleotide primers of claim 8. 

10. The composition of claim 9 wherein either or both of said oligonucleotide primers 
comprises at least one modified nucleobase. 
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11. The composition of claim 9 wherein either or both of said oligonucleotide primers 
comprises a non-templated T residue on the 5'-end. 

12. The composition of claim 9 wherein either or both of said oligonucleotide primers 
comprises at least one non-template tag. 

13. The composition of claim 9 wherein either or both of said oligonucleotide primers 
comprises at least one molecular mass modifying tag. 

14. A kit comprising the composition of claim 3 or claim 9. 

15. The kit of claim 14 further comprising at least one calibration polynucleotide. 

16. The kit of claim 14 further comprising at least one ion exchange resin, linked to 
magnetic beads. 

17. A method for identification of an unknown bacterium comprising: 

amplifying nucleic acid from said bacterium using the composition of claim 3 or claim 9 
to obtain an amplification product; 

determining the molecular mass of said amplification product; 

optionally determining the base composition of said amplification product from said 
molecular mass; and 

comparing said molecular mass or base composition of said amplification product with 
a plurality of molecular masses or base compositions of known bacterial bioagent identifying 
amplicons, wherein a match between said molecular mass or base composition of said 
amplification product and the molecular mass or base composition of a member of said plurality 
of molecular masses or base compositions identifies said unknown bacterium. 

18. The method of claim 17 wherein said molecular mass is determined by mass 
spectrometry. 

19. A method of determining the presence or absence of a bacterium of a particular clade, 
genus, species, or sub-species in a sample comprising: 
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amplifying nucleic acid from said sample using the composition of claim 3 or claim 9 to 
obtain an amplification product; 

determining the molecular mass of said amplification product; 

optionally determining the base composition of said amplification product from said 
molecular mass; and 

comparing said molecular mass or base composition of said amplification product with 
the known molecular masses or base compositions of one or more known clade, genus, species, 
or sub-species bioagent identifying amplicons, wherein a match between said molecular mass or 
base composition of said amplification product and the molecular mass or base composition of 
one or more known clade, genus, species, or sub-species bioagent identifying amplicons 
indicates the presence of said clade, genus, species, or sub-species in said sample. 

20. The method of claim 19 wherein said molecular mass is determined by mass 
spectrometry. 

21. A method for determination of the quantity of an unknown bacterium in a sample 
comprising: 

contacting said sample with the composition of claim 3 or claim 9 and a known quantity 
of a calibration polynucleotide comprising a calibration sequence; 

concurrently amplifying nucleic acid from said bacterium in said sample ~with the 
composition of claim 3 or claim 9 and amplifying nucleic acid from said calibration 
polynucleotide in said sample with the composition of claim 3 or claim 9 to obtakn a first 
amplification product comprising a bacterial bioagent identifying amplicon and a. second 
amplification product comprising a calibration amplicon; 

determining the molecular mass and abundance for said bacterial bioagent identifying 
amplicon and said calibration amplicon; and 

distinguishing said bacterial bioagent identifying amplicon from said calibration 
amplicon based on molecular mass, wherein comparison of bacterial bioagent identifying 
amplicon abundance and calibration amplicon abundance indicates the quantity of bacterium in 
said sample. 

22. The method of claim 21 further comprising determining the base composition of said 
bacterial bioagent identifying amplicon. 
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SEQUENCE LISTING 

<110> Isis Pharmaceuticals, Inc. 
Sampath, Rangarajan 
Hall, Thomas A. 
Ecker, David J. 
Eshoo, Mark. W. 
Massire, Christian 

Science Applications International Corporation 
Larson, Brons M. 
Leighton, Terrance 

<120> COMPOSITIONS FOR USE IN IDENTIFICATION OF BACTERIA 

<130> IBIS0074-500WO (DIBIS-0057WO1) 

<150> 60/545,425 
<151> 2004-02-18 

<150> 60/559,754 
<151> 2004-04-05 

<150> 60/632,862 1 
<151> 2004-12-03 

<150> 60/639,068 
<151> 2004-12-22 

<150> 60/648,188 
<151> 2005-01-28 

<160> 785 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 1 

gtgagatgtt gggttaagtc ccgtaacgag 30 

<210> 2 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 2 

atgttgggtt aagtcccgc 19 

<210> 3 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



1 
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<220> 

<223> Primer 



<400> 3 

atgttgggtt aagtcccgca acgag 



25 



<210> 4 
<211> 22 
<212> DNA 

<213> Art! ficial Sequence 

<220> 

<223> Primer 
<400> 4 

ttaagtcccg- caacgagcgc aa 22 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Printer 



<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Priiaer 
<400> 6 

ttaagtcccg- caacgatcgc aa 22 

<210> 7 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 8 



<400> 5 

tttaagtccc gcaacgagcg caa 



23 



<400> 7 

tagtcccgca acgagcgc 



18 



2 
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caacgagcgc aaccctt 

<210> 9 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



caagtcatca tggccctta 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 10 

gctacacacg tgctacaatg 

<210> 11 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 11 

cggattggag tctgcaactc g 

<210> 12 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 12 

aagtcggaat cg-ctagtaat eg 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 13 

tacggtgaat acgttcccgg g 

<210> 14 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



3 
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<220> 

<223> Primer 



<400> 14 

gccttgtaca cacctcccgt c 



21 



<210> 15 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 15 

cttgtacaca ccgcccgtc 19 

<210> 16 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 17 

<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 17 

tgaacgctgg tggcatgctt aacac 25 

<210> 18 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 19 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 19 



<400> 16 
ttgtacacae 



ic cgcccgtcat ac 



22 



<400> 18 



cactggaact gagacacgg 



19 
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gtggcatgcc taatacatgc aagtcg 

<210> 20 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tgagtgatga aggccttagg gttgtaaa 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 21 

taacacatgc aagtcgaacg 

<210> 22 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 22 

ccagcagccg cggtaatac 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 23 

cggaattact gggcgtaaag 

<210> 24 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 24 

gtgtagcggt gaaatgcg 

<210> 25 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 



<400> 25 

gagagtttga tcctggctca gaacgaa 



27 



<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 26 

agaacaccga tggcgaaggc 20 

<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 28 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 28 

gggagcaaac aggattagat ac 22 

<210> 29 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 30 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 30 

tggattagag accctggtag tec 23 



<400> 27 

tagaacaccg atggcgaagg c 



21 



<400> 29 

ggattagaga ccctggtagt cc 



22 
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<210> 31 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 31 

ggattagata ccctggtagt ccacgc 26 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 33 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 33 

gataccctgg tagtccacac eg 22 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 35 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 35 

accacgccgt aaacgatga 19 

<210> 36 
<211> 18 
<212> DNA 



<400> 32 

tagataccct ggtagtccac gc 



22 



<400> 34 

agagtttgat catggctcag 



20 
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<213> Artificial Sequence 



<220> 

<223> Primer 



<400> 36 

aagcggtgga gcatgtgg 



18 



<210> 37 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 37 

ttcgatgcaa cgcgaagaac ct 22 

<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 39 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 39 

acgcgaagaa ccttacc 17 

<210> 40 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 41 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 38 

tttcgatgca acgcgaagaa cct 



23 



<400> 40 

acgcgaagaa ccttacc 



17 



<400> 41 
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gcgaagaacc ttaccaggtc 20 

<210> 42 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 42 
cgaagaacct tacc 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 43 

tgcgcggaag atgtaacggg 

<210> 44 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<400> 44 

tgcatacaaa cagtcggagc ct 

<210> 45 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 45 

aaactagata acagtagaca tcac 

<210> 46 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 46 

taccccaaac cgacacagg 19 

<210> 47 
<211> 19 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 47 

ccgtaacttc gggagaagg 

<210> 48 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 48 

ctgacacctg cccggtgc 

<210> 49 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 49 

tctgacacct gcccggtgc 

<210> 50 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 50 

gacgcctgcc cggtgc 



<210> 51 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 51 

acctgcccag tgctggaag 

<210> 52 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 



<400> 52 

gggaactgaa acatctaagt a 



21 



<210> 53 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 53 

ggtggatgcc ttggc 15 

<210> 54 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 55 

tagaacgtcg cgagacagtt eg 22 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 



<210> 57 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 57 

ctgtccctag tacgagagga cegg 24 



<400> 54 

aaggtactcc ggggataaca ggc 



23 



<400> 56 

gaeagttegg tccctatc 



18 
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<210> 58 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 58 

tctgtcccta gtacgagagg accgg 

<210> 59 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 59 

ctgttcttag tacgagagga cc 

<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 60 

tctgttctta gtacgagagg acc 



<210> 61 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 61 

ctagtacgag aggaccgg 

<210> 62 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tagtacgaga ggaccgg 

<210> 63 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> ' 

<223> Primer 



<400> 63 

ggggagtgaa agagatcctg aaaccg 



26 



<210> 64 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 64 

cgagagggaa acaacccaga cc 22 

<210> 65 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 66 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 66 

ttgtagcaca gcaaggcaaa tttcctgaaa c 31 

<210> 67 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 68 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 68 

tcgtgattat ggatggcaac gtgaa 25 



<400> 65 

tcgtgcccgc aatttgcata aagc 



24 



<400> 67 

taggtttacg tcagtatggc gtgattatgg 



30 
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<210> 69 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 69 

ttatggatgg caacgtgaaa cgcgt 25 

<210> 70 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<22 0> 

<223> Primer 



<210> 71 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 71 

tactagcggt aagcttaaac aagattgc 28 

<210> 72 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 73 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 73 

tcggcgaaat ccgtattcct gaaaatga 28 

<210> 74 
<211> 27 

<212> DNA 



<400> 70 

tctttgccat tgaagatgac ttaagc 



26 



<400> 72 

ttgccaatga tattcgttgg ttagcaag 



28 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 74 

taccactatt aatgtcgctg gtgcttc 27 

<210> 75 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 75 

ttataactta ctgcaatcta ttcagttgct tggtg 35 



<210> 76 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 76 

tattgtttca aatgtacaag gtgaagtgcg 

<210> 77 
<211> 33 
<212> DNA 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 77 

tggttatgta ccaaatactt tgtctgaaga tgg 33 

<210> 78 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 78 

tgaagtgcgt gatgatatcg atgcacttga tgta 34 

<210> 79 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 79 
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tggaacgtta tcaggtgccc caaaaattcg 



30 



<210> 80 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> | 
<223> Primer 

<400> 80 

tcggtttagt aaaagaacgt attgctcaac c 31 



<210> 81 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 82 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 82 

tcaagcagaa gctttggaag aagaagg 27 

<210> 83 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 84 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 84 

ttgcttaaag ttggttttat tggttggcg 29 

<210> 85 
<211> 34 
<212> DNA 



<400> 81 

tcaacctgac tgcgtgaatg gttgt 



25 



<400> 83 

tgagattgct gaacatttaa tgctgattga 



30 



1 6 
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<213> Artificial Sequence 



<220> 

<223> Primer 



<400> 85 

tcagttttaa tgtctcgtat gatcgaatca aaag 



34 



<210> 86 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 86 

gcacaacctg cggctgcg 18 

<210> 87 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 88 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 88 

tatggctcta ctcaa 15 

<210> 89 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 90 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 37 

tctagtaata ataggaccct cage 



24 



<400> 89 

tgagtcactt gaagttgata caaatcctct 



30 
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<400> 90 

gaatagcaat taatccaaat 



20 



<210> 91 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 91 

tcagttccgt tatcgccatt gcat 24 

<210> 92 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 93 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 93 

tcactcttac atataaggaa ggcgctc 27 

<210> 94 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 95 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 95 

gttatttagc actcgttttt aatcagcc 28 



<400> 92 

tggaactatt gcaactgcta atg 



23 



<400> 94 

tcaggatgga aataaccacc aattcactac 



30 



<210> 96 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 96 

actcgttttt aatcagcccg 20 

<210> 97 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 97 

gattattgtt atcctgttat gccatttgag 30 

<210> 98 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 98 

tgattattgt tatcctgtta tgccatttga g 31 

<210> 99 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 99 

ttattgttat cctgttatgc c 

<210> 100 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 100 

gttatcctgt tatgccattt g 

<210> 101 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<400> 101 

ccgtggtatt ggagttattg 

<210> 102 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 102 

ttgagggtat gcaccgtctt tttgattctt t 

<210> 103 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 103 

agttataaac acggctttcc tatggcttat cc 

<210> 104 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 104 

tggcttatcc aaatttagat cgtggtttta c 

<210> 105 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 105 

ttatcgtttg tggagctagt gcttatgc 

<210> 106 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 106 

tgctcgagtg attgactttg ctaaatttag aga 
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<210> 107 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 107 

tgattttgct aaatttagag aaattgcgga tgaa 34 



<210> 108 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 108 

tcccaattaa ttctgccatt tttccaggta t 

<210> 109 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 109 

tcccggactt aatatcaatg aaaattgtgg a 

<210> 110 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 110 

tgcggatcgt ttggtggttg tagatgaaaa 



31 



<210> 111 

<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 111 

tcgtttggtg gtggtagatg aaaaagg 27 

<210> 112 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 112 

tagatgaaaa gggcgaagtg gctaatgg 

<210> 113 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 113 

tgcctagaag atcttaaaaa tttccgccaa ctt 

<210> 114 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 114 

tccccaggac accctgaaat ttcaac 

<210> 115 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tggcatttct tatgaagctt gttctttagc a 

<210> 116 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 116 

tgaagcttgt tctttagcag gacttca 

<210> 117 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



22 



WO 2005/098047 



PCT/US2005/005356 



<400> 117 

tttgatttta cgccgtcctc caggtcg 

<210> 118 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 118 

tcctgttatc cctgaagtag ttaatcaagt ttgt 

<210> 119 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<400> 119 

tcctgttatc cctgaagtag ttaatcaagt ttgtt 

<210> 120 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 120 

taggcgaaga tatacaaaga gtattagaag ctaga 

<210> 121 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 121 

tccaggacaa atgtatgaaa aatgtccaag aag 

<210> 122 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 122 

tgaaaaatgt ccaagaagca tagcaaaaaa agca 

<210> 123 
<211> 26 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 123 

tcttatgcca agaggacaga gtgagt 

<210> 124 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 124 

tgtattaggg gcatacagtc ctcatcc 

<210> 125 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 125 

gaaagagttc ggattggg 

<210> 126 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 126 

acaacgaagt acaatacaag ac 

<210> 127 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 127 

cgaagtacaa tacaagacaa aagaagg 

<210> 128 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 128 

tcgaagtaca atacaagaca aaagaagg 

<210> 129 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 129 

acaatacaag acaaaagaag g 

<210> 130 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 130 

caggtttagt accagaacat gcag 

<210> 131 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 131 

ggtttagtac cagaacatgc 

<210> 132 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 132 

cggcgtactt caacgacagc ca 



<210> 133 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Primer 
<400> 133 

ttatcagcta gaccttttag gtaaagctaa gc 

<210> 134 
<211> 32 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 134 

tccaaggtac actaaactta cttgagctaa tg 32 

<210> 135 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 135 

tcaaaaagcc ctaggtaaag agattccata tc 32 

<210> 136 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 136 

tccgttctta caaatagcaa tagaacttga age 33 

<210> 137 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 137 

tggagcttga agetatcget cttaaagatg 30 

<210> 138 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 138 

tggaacttga agctctcgct cttaaagatg 30 

<210> 139 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 139 

tgggacttga agctatcgct cttaaagatg 

<210> 140 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 140 

tcttctcatc ctatggctat tatgcttgc 

<210> 141 
<211> 24 
<212> DNA 



30 



29 



<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 141 

ggtgaaagaa gttgcctcta aagc 24 

<210> 142 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 142 

atggacaagg ttggcaagga agg 23 



<210> 143 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 143 

aaggaaggcg tgatcaccgt tgaaga 2 6 

<210> 144 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 144 

tggaagatct gggtcaggc 19 
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<210> 145 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 145 

tctgcccgtg tcgttggtga 



<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 146 

tccattgttc gtatggctca agact 

<210> 147 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tcaggtggct tacacggcgt ag 

<210> 148 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 148 

tctttcttga atgctggtgt acgtatcg 

<210> 149 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 149 

tcaacgaagg taaaaaccat ctcaacg 

<210> 150 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> ISO 

tgttcgctgt ttcacaaaca acattcca 

<210> 151 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 151 

tacttacttg agaatccaca agctgcaa 

<210> 152 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 152 

tggcgaacct ggtgaacgaa gc 

<210> 153 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 153 

tagttgctca aacagctggg ct 

<210> 154 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 154 

tcccggagct tttatgacta aagcagat 

<210> 155 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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tcgccgtgga aaaatcctac get 

<210> 156 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 156 

ttcctgaccg acccattatt ccctttatc 

<210> 157 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 157 

tcctgaccga cccattattc cctttatc 

<210> 158 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 158 

gtcgtgaaaa cgagctggaa ga 

<210> 159 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 159 

tgcgtttacc geaatgegtg c 

<210> 160 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 160 

tgctcgtggt gcacaagtaa eggatatta 

<210> 161 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 161 

ttgctcgtgg tgcacaagta acggatatta 

<210> 162 
<211> 26 
<212> DNA 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 162 

cgtcagggta aattccgtga agttaa 2 6 

<210> 163 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 163 

tggtaacaga gccttatagg cgca 24 

<210> 164 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 164 

tggctccttg gtatgactct gcttc 25 

<210> 165 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 165 

tgctgaggcc tggaccgatt atttac 26 

<210> 166 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 166 

ttatttacct gcactcccac aactg 25 
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<210> 167 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<22 0> 

<223> Primer 
<400> 167 

tccttgaccg cctttccgat ac 



<210> 168 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 168 

tgaggaccgt gtcgcgctca 

<210> 169 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 169 

tcagaccatg ctcgcagaga aactt 

<210> 170 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 170 

tcagtatgta tccaccgtag ccagtc 

<210> 171 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 171 

tgggtgacat tcatcaattt catcgttc 

<210> 172 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 172 

tcaagaagaa aaagagc 17 

<210> 173 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 173 

caagaagaaa aagagcttct aaaaagaata c 31 

<210> 174 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 174 

agcttttgca tattatatcg agccac 26 



<210> 175 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 175 

tagcttttgc atattatatc gagccac 

<210> 176 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 176 

cttttgcata ttatatcgag c 

<210> 177 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 177 

tttacagctt tatgcaccg 

<210> 178 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 178 

caacggatgc tggcaag 

<210> 179 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 17 9 

tgtagccg-ct aagcactacc atcc 

<210> 18 0 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 180 

tggacggcat cacgattctc tac 

<210> 181 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 181 

tgaagtagaa atgactgaac gtccga 

<210> 182 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 182 

taaaacaaac tacggtaaca ttgatcgca 
<210> 183 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 183 

tcaggtactg ctatccaccc tcaa 

<210> 184 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 184 

tgtactgcta tccacoctca a 

<210> 185 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 185 
tccaccctca a 

<210> 186 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 186 

tcaccaggtt caacstcaaaa aatattaaca 

<210> 187 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 187 

ttacacatat cgtgagcaat gaactga 

<210> 188 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 
<400> 188 

ttactccatt attgcttggt tacactttcc 

<210> 189 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tacacaacaa tggcggtaaa gatgg 

<210> 190 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 190 

tgcgcagctc ttggtatcga gtt 

<210> 191 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 191 

tgcctcgaag ctgaatataa ccaag-tt 

<210> 192 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 192 

tcaacggtaa cttctatgtt acttctg 

<210> 193 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 193 

tcaagccgta cgtattatta ggtgctg 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 194 

tccgtacgta ttattaggtg ctggtca 

<210> 195 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 195 

tcgtacgtat tattaggtgc tggtcact 

<210> 196 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 196 

tgttggtgct ttctggcgct taa 

<210> 197 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 197 

tggtgctttc tggcgcttaa acga 

<210> 198 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<22 0> 

<223> Primer 
<400> 198 

tctactgatt ttggtaatct tgcagcacag 

<210> 199 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 199 

tgcaagtggt acttcaacat gggg 

<210> 200 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 200 

ttacaggaag tttaggtggt aatctaaaag g 

<210> 201 
<211> 20 
<212> DNA 



24 



31 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 201 

cagaatcaag ttcccagggg 20 

<210> 202 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 202 

agaatcaagt tcccaggggt tac 23 

<210> 203 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 203 

aatctgctat ttggtcagg 19 

<210> 204 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 204 

gaaggatata cggttgatgt c 21 
<210> 205 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 205 

tcctgaaaaa tggagcacgg 

<210> 206 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 206 

tggagcacgg cttctgatc 



<210> 207 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 207 

ggctcagcca tttagttacc gctat 

<210> 208 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tcagcgcgta cagtgggtga t 

<210> 209 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<400> 209 

tggtgactcg gcatgttatg aagc 

<210> 210 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 210 

ttataccgga aacttcccga aaggag 

<210> 211 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tgacatccgg ctcacgttat tatggt 

<210> 212 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 212 

tccggctcac gttattatgg tac 

<210> 213 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 213 

tgcaaaggag gtactcagac cat 

<210> 214 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 214 

tgacatgctt gtccgttcag gc 

<210> 215 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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tggtacatgt gccttcattg atgctg 

<210> 216 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 216 

tggcacggcc atctccgtg 

<210> 217 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 217 

tgcgggtagg gagcttgagc 

<210> 218 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 218 

tcctagagga atggctgcca eg 

<210> 219 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 219 

taccccaggg aaagtgccac aga 

<210> 220 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 220 

taaaccccat egggagcaag accgaata 

<210> 221 
<211> 19 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Primer 



<400> 221 

gaggaaagtc catgctcgc 



<210> 222 
<211> 29 



<212> DNA 

<213> Artificial Sequence 



<400> 222 

taaggatagt gcaacagaga tataccgcc 

<210> 223 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 223 
gaggaaagtc cgggctc 

<210> 224 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 224 

tctaaatggt cgtgcagttg cgtg 

<210> 225 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 225 

tggtaagagc gcaccggtaa gttggtaaca 

<210> 226 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



42 



WO 2005/098047 



PCT/US2005/005356 



<400> 226 

taagagcgca ccggtaagtt gg 



22 



<210> 227 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 227 

tgcataccgg taagttggca aca 23 

<210> 228 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 228 

tccaccaaga gcaagatcaa ataggc 26 

<210> 229 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 230 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 230 

tccgcggagt tgactgggt 19 

<210> 231 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 231 

gacctacagt aagaggttct gtaatgaacc 30 

<210> 232 
<211> 31 
<212> DNA 



<400> 229 

gaggaaagtc catgctcac 



19 
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<213> Artifi 



tl Sequence 



<220> 

<223> Primer 



<400> 232 

tgacctacag taagaggttc tgtaatgaac c 



31 



<210> 233 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 233 

tgtaatgaac cctaatgacc atccacacgg 30 

<210> 234 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<210> 235 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 235 

catccacacg gtggtggtga agg 23 

<210> 236 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 236 

tcatccacac ggtggtggtg aagg 24 

<210> 237 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 



<220> 
<223> Pri: 



<400> 234 

taatgaaccc taatgaccat ccacacggtg 



30 
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<400> 237 

tccacacggt ggtggtgaag g 

<210> 238 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 238 

gaccacctcg gcaaccgt 



<210> 239 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 239 

tcagctgtcg cagttcatgg acc 

<210> 240 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 240 

tatcgctcag gcgaactcca ac 

<210> 241 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 241 

ttatcgctca ggcgaactcc aac 

<210> 242 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 242 

tcgttcctgg aacacgatga cgc 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 243 

tcaacaacct cttggaggta aagctcagt 

<210> 244 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 244 

cttggaggta agtctcattt tggtgggca 

<210> 245 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



29 



29 



<220> 

<223> Primer 
<400> 245 

tgggcagcgt ttcggcgaaa tgga 24 

<210> 246 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 246 

gggcagcgtt tcggcgaaat gga 23 

<210> 247 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 247 

cagcgtttcg gcgaaatgga 20 

<210> 248 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 248 

caaaacttat taggtaagcg tgttgact 

<210> 249 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 249 

cgtgttgact attcggggcg ttcag 

<210> 250 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 250 

taagaagccg gaaaccatca actaccg 

<210> 251 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 251 

acccagtgct gctgaaccgt gc 

<210> 252 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 252 

cgccgacttc gacggtgacc 

<210> 253 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 253 

tcgccgactt cgacggtgac c 
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<210> 254 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 254 

tggcccgaaa gaagctgagc g 

<210> 255 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 255 

tcaggagtcg ttcaactcga tctacatgat g 

<210> 256 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 256 

caggagtcgt tcaactcgat ctacatgat 

<210> 257 
<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 257 

tcaggagtcg ttcaactcga tctacatgat 

<210> 258 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 258 

tgattccggt gcccgtggt 

<210> 259 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<400> 259 

tgattctggt gcccgtggt 

<210> 260 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 260 

cttgctggta tgcgtggtct gatg 

<210> 261 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> Primer 

<400> 261 

ctggcaggta tgcgtggtct gatg 

<210> 262 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 262 

tctggcaggt atgcgtggtc tgatg 

<210> 263 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 263 

tggtatgcgt ggtctgatgg c 

<210> 264 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 264 

tgctcgtaag ggtctggcgg atac 
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<210> 265 
<211> 26 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 265 

cgtcgtgtaa ttaaccgtaa caaccg 

<210> 266 
<211> 26 
<212> DNA, 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 266 

cgtcgggtga ttaaccgtaa caaccg 

<210> 267 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 267 

tattggacaa cggtcgtcgc gg 

<210> 268 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 268 

tctggataac ggtcgtcgcg g 

<210> 269 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 269 

caaaggtaag caaggacgtt tccgtca 



<210> 270 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 270 

caaaggtaag caaggtcgtt tccgtca 27 



<210> 271 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 271 

aaccttaatt ggaaagaaac ccaagaagt 

<210> 272 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 272 

taaccttaat tggaaagaaa cccaagaagt 

<210> 273 
<211> 26 
<212> DNA 



29 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 273 

caataccgca acagcggtgg cttggg 26 

<210> 274 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 274 

tcaataccgc aacagcggtg gcttggg 27 

<210> 275 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 275 
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gctggtgaaa ataacccaga tgtcgtc-ttc 

<210> 276 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 276 

tgctggtgaa aataacccag atgtcgtctt c 

<210> 277 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 277 

cgcaaaaaaa tccagctatt age 

<210> 278 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 278 

tegcaaaaaa atccagctat tagc 

<210> 279 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 279 

cgagtatagc taaaaaaata gtttatgaca 

<210> 280 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 280 

tcgagtatag ctaaaaaaat agtttatgac a 

<210> 281 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 281 

cctatattaa tcgtttacag aaactggct 2 9 



<210> 282 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 282 

tcctatatta atcgtttaca gaaactggct 

<210> 283 
<211> 23 
<212> DNA 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 283 

ctggctaaaa ctttggcaac ggt 23 

<210> 284 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 284 

tctggctaaa actttggcaa cggt 24 

<210> 285 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 285 

atgattacaa ttcaagaagg tcgtcacgc 29 

<210> 286 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 286 

tatgattaca attcaagaag gtcgtcacgc 

<210> 287 
<211> 25 
<212> DNA 



30 



<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 287 

taacggttat catggcccag atggg 25 

<210> 288 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 288 

ttaacggtta tcatggccca gatggg 2 6 

<210> 289 

<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 289 

agcaggtggt gaaatcggcc acatgatt 2 8 

<210> 290 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 290 

tagcaggtgg tgaaatcggc cacatgatt 2 9 

<210> 291 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 291 

cagagaccgt tttatcctat cage 24 
<210> 292 
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<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 292 

tcagagaccg ttttatccta tcagc 

<210> 293 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 293 

tctaaaacac caggtcaccc agaag 

<210> 294 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 294 

ttctaaaaca ccaggtcacc cagaag 

<210> 295 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 295 

atggccatgg cagaagctca 

<210> 296 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 296 

tatggccatg gcagaagctc a 

<210> 297 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 297 

cttgtacttg tggctcacac ggctgtttgg 

<210> 298 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<400> 298 

tcttgtactt gtggctcaca cggctgtttg g 

<210> 299 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 299 

accatgacag aaggcatttt gaca 

<210> 300 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 300 

taccatgaca gaaggcattt tgaca 

<210> 301 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 301 

gatgactttt tagctaatgg tcaggcagc 

<210> 302 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<400> 302 

tgatgacttt ttagctaatg gtcaggcagc 



<210> 303 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Primer 
<400> 303 

tagctaatgg tcaggcagcc 20 

<210> 304 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 304 

gtcaaagtgg cacgtttact ggc 23 



<210> 305 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 305 

tgtcaaagtg gcacgtttac tggc 

<210> 306 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 306 

agcgtaaagg tgaacctt 

<210> 307 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tagcgtaaag gtgaacctt 

<210> 308 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 308 

gcttcaggaa tcaatgatgg agcag 

<210> 309 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 309 

tgcttcagga atcaatgatg gagcag 

<210> 310 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 310 

ggggattcag ccatcaaagc agctattgac 

<210> 311 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 311 

tggggattca gccatcaaag cagctattga c 

<210> 312 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



30 



31 



<220> 

<223> Primer 
<400> 312 

tcagccatca aagcagctat tg 22 

<210> 313 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 313 
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ccttacttcg aactatgaat cttttggaag 

<210> 314 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 314 

tccttacttc gaactatgaa tcttttggaa g 

<210> 315 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 315 

ggggattgat atcaccgata agaagaa 

<210> 316 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 316 

tggggattga tatcaccgat aagaagaa 

<210> 317 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 317 

tcgccaatca aaactaaggg aatggc 

<210> 318 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 318 

ttcgccaatc aaaactaagg gaatggc 

<210> 319 
<211> 29 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 319 

gggcaacagc agcggattgc gattgcgcg 

<210> 320 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 320 

tgggcaacag cagcggattg cgattgcgcg 

<210> 321 
<211> 24 
<212> DNA 



29 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 321 

tcaagcaaac gcacaatcag aagc 24 

<210> 322 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 322 

caagcaaacg cacaatcaga age 23 

<210> 323 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 323 

aacgeacaat cagaagc ^7 

<210> 324 

<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<40O> 324 

tgcacaatca gaagctaaga aagcgcaagc t 

<210> 325 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 325 

tgcaagcttc tggtgctagc att 

<210> 326 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 326 

tgcttctggt gctagcatt 

<210> 327 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<40O> 327 
tggtgctagc att 

<210> 328 
<21L> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 328 

tgctagttat ggtacagagt ttgcgac 

<210> 329 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 329 

tggtacagag tttgcgac 

<210> 330 
<211> 15 
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<212> DNA 

<213> Artificial Sequence 



<400> 330 
tacagagttt gcgac 

<210> 331 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 331 

tcgattaggc agcaacgaaa gccg 

<210> 332 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 332 

tcgacctttg gcaggaacta gac 

<210> 333 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 333 

tcaaatgtac aaggtgaagt gcgtga 

<210> 334 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 334 

tggatggcat ggtgaaatgg atatgtc 



<210> 335 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 335 

atgtcgattg caatccgtac ttgtg 

<210> 336 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 336 

gtgcatgcgg atacagagca gag 

<210> 337 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 337 

tgcaagcgcg accacatacg 

<210> 338 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 338 

gcactatgca cacgtagatt gtcctgg 

<210> 339 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 339 

ttgactgccc aggtcacgct g 



<210> 340 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 340 

tagactgccc aggacacgct g 
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<210> 341 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 341 

tgcacgccga ctatgttaag aacatgat 

<210> 342 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 342 

tgatcactgg tgctgctcag atgga 

<210> 343 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 343 

aagacgacct gcacgggc 

<210> 344 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 344 

ccacacgccg ttcttcaaca act 

<210> 345 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 345 

tccaeacgcc gttcttcaac aact 

<210> 346 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 346 

aactaccgtc ctcagttcta cttcc 

<210> 347 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 347 

aactaccgtc cgcagttcta cttcc 

<210> 348 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 348 

ccacagttct acttccgtac tactgacg 

<210> 349 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 349 

cgtggcggcg tggttatcga 

<210> 350 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 350 

-tcgtggcggc gtggttatcg a 

<210> 351 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 351 

-fcatgctgacc gaccagtggt acgt 



25 
28 
20 
21 
24 
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<210> 352 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 352 

cgacgcgctg cgcttcac 

<210> 353 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 353 

cttctgcaac aagctgtgga acgc 

<210> 354 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 354 

accgagcaag gagaccagc 

<210> 355 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 355 

tcttgc-tctt tcgtgagttc agtaaat< 

<210> 356 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 356 

tcgatc-tggt ttcatgctgt ttcagt 

<210> 357 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 357 

ttactcaccc gtccgccgct 

<210> 358 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 358 

tgttactcac ccgtctgcca ct 



<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 359 

ttactcaccc gtccgcc 

<210> 360 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 360 

acaaccatgc accacctgtc 

<210> 361 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 361 

tacgcattac tcacccgtcc gc 

<210> 362 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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acgagctgac gacagccatg 

<210> 363 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 363 

tacgagctga cgacagccat g 

<210> 364 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 364 

acgacacgag ctgacgac 

<210> 365 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 365 
acacgagctg ac 

<210> 366 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 366 
tccccacctt cctcc 



<210> 367 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 367 

gacgtcatcc ccaccttcct cc 

<210> 368 
<211> 21 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 368 

gacgtcatcc ccaccttcct c 

<210> 369 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<400> 369 

tgacgtcatc cccaccttcc tc 

<210> 370 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 370 

ttgacgtcat ccccaccttc etc 

<210> 371 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 371 

ttgacgtcat ccccaccttc etc 

<210> 372 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 372 

tgacgtcatg gccaccttcc 

<210> 373 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 373 

tgacgtcatg cccaccttcc 

<210> 374 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 374 

tgacgtcatc cccaccttcc 



<210> 375 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 375 

attgtagcac gtgtgtagcc c 

<210> 376 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 376 

cgagttgcag actgcgatcc g 

<210> 377 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 377 

cgagttgcag actgcgatcc g 

<210> 378 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 378 

gacgggcggt gtgtacaag 



<210> 379 
<211> 22 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 379 

accttgttac gacttcaccc ca 

<210> 380 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 380 

ccttgttacg acttcacccc 

<210> 381 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 381 

cacggctacc ttgttacgac 

<210> 382 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 382 

. tccagcc 



<210> 383 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 383 

actgctgcct cccgtag 

<210> 384 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Primer 
<400> 384 

cggctgctgg cacgaagtta g 

<210> 385 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 385 

ctttacgccc agtaattccg 

<210> 386 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 386 

cgcatttcac cgctacac 

<210> 387 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 387 

gtatctaatc ctgtttgctc cc 

<210> 388 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 388 

cgtggactac cagggtatct a 

<210> 389 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tcgtggacta ccagggtatc ta 
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<210> 390 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 390 
cgtactcccc aggcg 

<210> 391 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<400> 391 

ggccgtactc cccaggcg 

<210> 392 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 392 

tggccgtact ccccaggcg 

<210> 393 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 393 

gcgaccgtac tccccagg 



<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 394 

gccttgcgac cgtactccc 

<210> 395 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 395 

cccccgtcaa ttcctttgag t 

<210> 396 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 396 

ggtaaggttc ttcgcgttg 

<210> 397 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 397 

tcgcaggctt acagaacgct ctccta 

<210> 398 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tcggactcgc tttcgctacg 



<210> 399 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 399 

gtgcgccctt tctaactt 

<210> 400 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 400 

tggctgcttc taagccaac 
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<210> 401 
<211> 16 
<212> DNA 

<213> Artificial Sequence 



<400> 401 
gggtttcccc attcgg 

<210> 402 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<400> 402 

ccttctcccg aagttacg 

<210> 403 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



caccgggcag gcgtc 

<210> 404 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 404 

gaccgttata gttacggcc 

<210> 405 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 405 

tgaccgttat agttacggcc 

<210> 406 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



75 



WO 2005/098047 



PCT/US2005/005356 



<220> 

<223> Primer 
<400> 406 

tcgctacctt aggaccgt 

<210> 407 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 407 

ccgacaagga atttcgctac c 

<210> 408 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 408 
ttcgctcgcc gctac 

<210> 409 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 409 

agccgacatc gaggtgccaa ac 

<210> 410 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 410 

ccggtcctct cgtacta 

<210> 411 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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agtccatccc ggtcctctcg 

<210> 412 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 412 

ttagatgctt tcagcactta tc 

<210> 413 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 413 

acttagatgc tttcagcggt 

<210> 414 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tgcttagatg ctttcagc 

<210> 415 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> Primer 
<400> 415 

ttcgtgctta gatgctttca g 21 

<210> 416 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<2 20> 

<223> Primer 
<400> 416 

tttcgtgctt agatgctttc ag 22 



<210> 417 
<211> 24 
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<212> DNA 

<213> Artificial Sequence 



<400> 417 

gtttcatgct tagatgcttt cage 

<210> 418 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 418 

acaaaaggca cgccatcacc c 

<210> 419 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 419 

acaaaaggta cgccgtcacc c 

<210> 420 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 420 

taatgccggg tagtgeaate cattcttcta g 

<210> 421 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 421 

tgcacctgcg gtcgagcg 

<210> 422 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 422 

tgccatccat aatcacgcca tactgacg 

<210> 423 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 423 

tgccagtttc cacatttcac gttcgtg 

<210> 424 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 424 

tcgcttgagt gtagtcatga ttgcg 

<210> 425 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 425 

tgagtcgggt tcactttacc tggca 

<210> 426 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 426 

ttgtacattt gaaacaatat gcatgacatg tgaat 

<210> 427 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 427 

taccggaagc accagcgaca ttaatag 

<210> 428 
<211> 32 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 428 

tgcaactgaa tagattgcag taagttataa gc 

<210> 429 
<211> 34 
<212> DNA 



32 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 429 

tgaattatgc aagaagtgat caattttctc acga 34 

<210> 430 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 430 

tgccgtaact aacataagag aattatgcaa gaa 33 



<210> 431 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 431 

tgacggcatc gataccaccg tc 22 

<210> 432 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 432 

tcacaggttc tacttcatca ataatttcca ttgc 34 

<210> 433 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<4O0> 433 

tccgccaaaa actccccttt tcacagg 

<2L0> 434 
<2L1> 30 
<2L2> DNA 

<2L3> Artificial Sequence 
<22 0> 

<22 3> Primer 
<40 0> 434 

ttg-caatcga catatccatt tcaccatgcc 

<21 0> 435 
<21 1> 24 
<212> DNA 

<21 3> Artificial Sequence 
<22 0> 

<22 3> Primer 
<40 0> 435 

ttc-tgcttga ggaatagtgc gtgg 

<21 0> 436 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<22 0> 

<22 3> Primer 

<40O> 436 

tacgttctac gatttcttca tcaggtacat c 

<210> 437 
<?.:.1> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<40O> 437 

tacaacgtga taaacacgac cagaagc 

<210> 438 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<40O> 438 

tccatattgt tgcataaaac ctgttggc 
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<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 439 

tcjagatgtcg aaaaaaacgt tggcaaaata c 

<210> 440 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 440 

acggcacgag gtagtcgc 18 

<210> 441 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Primer 
<400> 441 

taaccatttc gcgtaagatt caa 23 

<210> 442 
<211> 28 
<2 12> DNA 

<2 13> Artificial Sequence 
<220> 

<2 23> Primer 
<4 O0> 442 

tcatgtgcta atgttactgc tggatctg 28 

<2 10> 443 
<2 11> 14 
<2 12> DNA 

<2 13> Artificial Sequence 
<2 20> 

<2 23> Primer 
<4 O0> 443 

tg-ttactgct ggat 14 

<2 10> 444 
<2 11> 17 
<2 12> DNA 

<2 23> Artificial Sequence 
<220> 
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<223> Primer 



<210> 445 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 445 

tgcgggctgg ttcaacaaga g 

<210> 446 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 446 

tgatgcgggc tggttcaac 

<210> 447 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 447 

tcctgtttta tagccgccaa gagtaag 

<210> 448 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 448 

tcaaggttct caccgtttac cttaggag 

<210> 449 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 449 

tgaatcttga aacaccatac <gtaacg 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 450 

tgaatcttga aacaccatac g 

<210> 451 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 451 

gtaacccttg tctttgaatt gtatttgc 

<210> 452 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 452 

tgtaaccctt gtctttgaat tgtatttgc 

<210> 453 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 453 

ggtaaccctt gtctttgaat 

<210> 454 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 454 

tggtaaccct tgtctttg 

<210> 455 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Primer 
<400> 455 

tcccttattt ttctttctac taccttcgga taat 34 



<210> 456 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 456 

tcccctcatg tttaaatgat caggataaaa age 

<210> 457 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 457 

tcggtttaag ctctacatga tegtaaggat a 

<210> 458 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



33 



31 



<220> 

<223> Primer 
<400> 458 

tttgctcatg atetgeatga agcataaa 28 

<210> 459 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 459 

tgcaatgtgt gctatgtcag caaaaagat 29 

<210> 460 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 460 

tgagcgtgtg gaaaaggact tggatg 26 
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<210> 461 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 461 

tatgtgtagt tgagcttact acatgagc 

<210> 462 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 462 

tggttcttac ttgctttgca taaactttcc a 



<210> 463 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 463 

tcgatccgca tcaccatcaa aagcaaa 

<210> 464 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 464 

tccacactgg attgtaattt accttgttct tt 

<210> 465 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 465 

tctctttcaa agcaccattg ctcattatag t 

<210> 466 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 466 

tgaattcttt caaagcacca ttgctcatta tagt 34 

<210> 467 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 467 

ttgctgccat agcaaagcct acagc 25 

<210> 468 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 468 

tgtgcttttt ttgctgccat agcaaagc 28 

<210> 469 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 469 

tgcttcaaaa cgcattttta cattttcgtt aaag 34 

<210> 470 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 470 

tcctccttgt gcctcaaaac gcattttta 29 

<210> 471 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 471 

tcaaagaacc cgcacctaat tcatcattta 30 
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<210> 472 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 472 

tcaactggtt caaaaacatt aagttgtaat tgtcc 

<210> 473 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 473 

tacaactggt tcaaaaacat taagctgtaa ttgtc 

<210> 474 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 474 i 

ttcattttct ggtccaaagt aagcagtatc 

<210> 475 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 475 

tcccgaacaa tgagttgtat caactatttt tac 

<210> 476 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 476 

tgcctaacaa atcccgtctg agttc 

<210> 477 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 



<400> 477 
tgtcatcaag 



<210> 478 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 478 

ccacttttaa taaggtttgt age 

<210> 479 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 479 

tgttgaccat gcttcttag 

<210> 480 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 480 

cttctacatt tttagecate ac 

<210> 481 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 481 
eggcttcaag acccc 

<210> 482 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 482 

tgttaacggc ttcaagaccc 
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<210> 483 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 483 

ttgttaacgg cttcaagacc c 

<210> 484 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 484 

accactttta ataaggtttg tagctaac 

<210> 485 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 485 

cgcggtcggc tcgttgatga 

<210> 486 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 486 

tcacctacag ctttaaagcc agcaaaatg 

<210> 487 
<211> 33 
<212> DNA 



20 



29 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 487 

tcttctgtaa agggtggttt attattcatc cca 33 

<210> 488 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<223> Primer 
<400> 488 

tagccttggc aacatcagca aaact 

<210> 489 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 489 

ttggcgacgg tatacccata gctttata 

<210> 490 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 490 

tgaacatttg cgacggtata cccat 

<210> 491 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 491 

tgtgaacatt tgcgacggta tacccat 

<210> 492 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 492 

tggtgggtat cttagcaatc attctaatag c 

<210> 493 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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tgcgatggta ggtatcttag caatcattct 

<210> 494 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 494 

caatctgctg acggatctga gc 



<210> 495 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



ttcaggtcca tcgggttcat gcc 

<210> 496 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 496 

ccgcggtcga attgcatgcc ttc 

<210> 497 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 497 

tagccgcggt cgaattgcat 

<210> 498 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 498 

tcgaaccgaa gttaccctga ccat 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tgccagctta gtcatacgga cttc 

<210> 500 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<4O0> 500 

tattgcggat caccatgatg atattcttgc 

<210> 501 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<4O0> 501 

tcgttgagat ggtttttacc ttcgttg 

<210> 502 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<4O0> 502 

tttgtgaaac agcgaacatt ttcttggta 

<210> 503 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 503 

tcacgcgcat catcaccagt ca 

<210> 504 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<400> 504 

tcctgcaata tctaatgcac tcttacg 

<210> 505 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 505 

acctgcaata tctaatgcac tcttacg 

<210> 506 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 506 

ctttcgcttt ctcgaactca accat 

<210> 507 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 507 

tagcccagct gtttgagcaa ct 

<210> 508 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 508 

tccctaatag tagaaataac tgcatcagta gc 

<210> 509 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 509 

tccctaatag tagaaataac tgcatcagta gc 
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<210> 510 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 510 

taggattttt ccacggcggc ate 

<210> 511 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 511 

tagectttte teeggegtag atct 



<210> 512 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 512 

catgatggtc acaacegg 

<210> 513 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 513 

tcggcatcac gccgtcgtc 

<210> 514 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 514 

tgetgettte gcatggttaa ttgcttcaa 

<210> 515 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 515 

ttgctgcttt cgcatggtta attgcttcaa 

<210> 516 
<211> 21 
<212> DNA 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 516 

aacttcgcct tcggtcatgt t 21 

<210> 517 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 517 

ttgcgttgca gattatcttt accaa 25 

<210> 518 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 518 

tgttaagtgt gttgcggctg tctttatt 28 

<210> 519 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 519 

tcacgcgacg agtgccatcc attg 24 

<210> 520 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 520 

tgacccaaag ctgaaagctt tactg 25 
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<210> 521 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 521 

ttttccagcc atgcagcgac 

<210> 522 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 522 

tccttctgat gcctgatgga ccaggag 

<210> 523 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



<223> Primer 
<400> 523 

tgtcactccc gacacgcca 

<210> 524 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 524 

taaacgtccg ataccaatgg ttcgctc 

<210> 525 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 525 

tcaacaacac ctccttattc ccactc 

<210> 526 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400=> 526 

gaatatcaat ttgtagc 17 



<210> 527 
<211:> 31 
<212> DNA 

<213> Artificial Sequence 



<400> 527 

agataaagaa tcacgaatat caatttgtag c 

<210> 528 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 523 

aggafcagatt tatttcttgt teg 

<210> 529 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 529 

tcttccaagg atagatttat ttcttgttcg 

<210> 530 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 530 

ttcttccaag gatagattta tttcttgttc g 

<210> 531 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
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tcttgacagc atccgttg 

<210> 532 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 532 

cagataaaga atcgctccag 

<210> 533 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 533 

tctcatcccg atattaccgc catga 

<210> 534 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 534 

tggcaacagc tcaacacctt tgg 

<210> 535 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 535 

tgatcctgaa tgtttatatc tttaacgcct 

<210> 536 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 536 

tcccaatcta acttccacat accatct 

<210> 537 
<211> 27 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 537 

tggatagacg tcatatgaag gtgtgct 

<210> 538 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 538 

tattcttcgt tactcatgcc ataca 

<210> 539 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 539 
tactcatgcc a 

<210> 540 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 540 
tattcttcgt t 

<210> 541 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 541 

taaccacccc aagatttatc tttttg-cca 

<210> 542 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 542 

tgtgatatgg aggtgtagaa ggtgtta 

<210> 543 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 543 

gagctgcgcc aacgaataaa tcgtc 

<210> 544 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 544 

tacgtcgcct ttaacttggt tatattcagc 

<210> 545 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 545 

tgccgtaaca tagaagttac cgttgatt 

<210> 546 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 546 

tcgggcgtag tttttagtaa ttaaatcaga agt 



<210> 547 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 547 

tcgtcgtatt tatagtgacc agcaccta 
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<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 548 

tcaacaccag cgttacctaa agtacctt 

<210> 549 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<400> 549 

tttaagcgcc agaaagcacc aac 

<210> 550 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 550 

tcgtttaagc gccagaaagc accaa 

<210> 551 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



taagccagca agagctgtat agttcca 

<210> 552 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 552 

tacaggagca gcaggcttca ag 

<210> 553 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 
<400> 553 

tagcagcaaa agttatcaca cctgcagt 

<210> 554 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 554 

tggttgtagt tcctgtagtt gttgcattaa ■ 

<210> 555 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 555 

tcctgcagct ctacctgctc catta 

<210> 556 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 556 

ccctgtagta gaagaggtaa ccac 

<210> 557 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 557 

cctgtagtag aagaggtaac 

<210> 558 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 558 

tgattatcag cggaagtag 
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<210> 559 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<400> 559 

ccgtgctcca tttttcag 

<210> 560 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 560 

tcggataagc tgccacaagg 

<210> 561 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 561 

tcggataagc tgccacaagg 

<210> 562 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 562 

ttcccctgac cttcgattaa aggatagc 

<210> 563 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 563 

ggtataacgc atcgcagcaa aagattta 

<210> 564 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 564 

ttcggtataa cgcatcgcag ca 

<210> 565 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 565 

tcgctcagca ataattcact ataagccga 

<210> 566 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



22 



29 



<220> 

<223> Primer 
<400> 566 

taatgcgata ctggcctgca agtc 2 4 

<210> 567 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 567 

tgtaaattcc gcaaagactt tggcattag 29 

<210> 568 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 568 

tggtctgagt acctcctttg c 21 

<210> 569 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 569 

tattggaaat accggcagca tctc 24 
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<210> 570 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 570 

ttcaagtgct tgctcaccat tgtc 

<210> 571 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 571 

tggctcataa gacgcgcttg taga 

<210> 572 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 572 

tcgtttcacc ctgtcatgcc g 

<210> 573 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 573 

tccgataagc cggattctgt gc 

<210> 574 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 574 

tgccgataag ccggattctg tgc 

<210> 575 
<211> 27 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 575 

tctcttaccc caccctttca cccttac 

<210> 576 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 576 

tgcctcgtgc aacccacccg 

<210> 577 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 577 

tgcctcgcgc aacctacccg 

<210> 578 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 578 

gtaagccatg ttttgttcca tc 

<210> 579 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 579 

tttacctcgc ctttccaccc ttacc 

<210> 580 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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tgctcttacc tcaccgttcc acccttacc 

<210> 581 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 581 

ataagccggg ttctgtcg 

<210> 582 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 582 

tctatagagt ccggactttc ctcgtga 

<210> 583 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 583 

tcaagcgatc tacccgcatt acaa 

<210> 584 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 584 

ataagccatg ttctgttcca tc 

<210> 585 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 585 

tgactttcct cccccttatc agtctcc 

<210> 586 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 586 

ccaagtgctg gtttacccca tggagta 

<210> 587 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 587 

gtgctggttt accccatgga gt 

<210> 588 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 588 

tgtgctggtt taccccatgg agt 

<210> 589 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 589 

tgtgctggtt taccccatgg ag 

<210> 590 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 590 

tccaagtgct ggtttacccc atggag 2 6 



<210> 591 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 591 

tccaagtgct ggtttacccc atgg 

<210> 592 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 592 

ttccaagtgc tggtttaccc catgg 

<210> 593 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 593 

tgttttgtat ccaagtgctg gtttacccc 

<210> 594 
<211> 18 
<212> DNA 



25 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 594 

ttcgctctcg gcctggcc 18 

<210> 595 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 595 

tcgtcgcgga cttcgaagcc 20 

<21C> 596 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 596 

gctggattcg cctttgctac g 21 
<210> 597 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 597 

tgctggattc gcctttgcta eg 22 

<210> 598 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 598 

ttgacgttgc atgttcgagc ccat 24 

<210> 599 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 599 

egtataaget gcaccataag cttgtaatgc 30 

<210> 600 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 600 

tttcttgaag agtatgagct getcegtaag 30 

<210> 601 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 601 

cgacttgacg gttaacattt cctg 24 

<210> 602 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 602 

gtccgacttg acggtcaaca tttcctg 

<210> 603 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 603 

tgtccgactt gacggtcaac atttcctg 

<210> 604 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 604 

tgtccgactt gacggttagc atttcctg 

<210> 605 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 605 

tgtccgactt gacggtcagc atttcctg 

<210> 606 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 606 

tccagcaggt tctgacggaa acg 

<210> 607 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 607 

ttaccgagca ggttctgacg gaaacg 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 



<400> 608 

cgaacggcca gagtagtcaa cacg 

<210> 609 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 609 

cgaacggcct gagtagtcaa cacg 

<210> 610 
<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 610 

tcaagcgcca tctctttcgg taatccacat 

<210> 611 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 611 

tcaagcgcca tttcttttgg taaaccacat 

<210> 612 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 612 

attcaagagc catttctttt ggtaaaccac 

<210> 613 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<400> 613 

gttcaaatgc ctggataccc a 

<210> 614 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 614 

gagcatcagc gtgcgtgct 

<210> 615 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 615 

tgagcatcag cgtgcgtgct 

<210> 616 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 616 

acgcgggcat gcagagatgc c 

<210> 617 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 617 

ggcgc-ttgta cttaccgcac 

<210> 618 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 618 

ttggccatca gaccacgcat ac 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 619 

ttggccatca ggccacgcat ac 

<210> 620 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 620 

acgccatcag gccacgcat 

<210> 621 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 621 

tacgccatca ggccacgcat 

<210> 622 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 622 

ttacgccatc aggccacgca 



<210> 623 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 623 

cgcaccatgc gtagagatga agtac 

<210> 624 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 
<400> 624 

cgcaccgtgg gttgagatga agtac 25 

<210> 625 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 625 

tcgcaccgtg ggttgagatg aagtac 

<210> 626 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 626 

tgctagacct ttacgtgcac cgtg 

<210> 627 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 627 

tactagacga cgggtcaggt aacc 

<210> 628 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 628 

gtttttcgtt gcgtacgatg atgtc 

<210> 629 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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acgtttttcg ttttgaacga taatgct 

<210> 630 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 630 

gaccccaacc tggccttttg tcgttga 

<210> 631 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 631 

tgaccccaac ctggcctttt gtcgttga 

<210> 632 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 632 

aaactatttt tttagctata ctcgaacac 

<210> 633 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 633 

taaactattt ttttagctat actcgaacac 

<210> 634 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 634 

ggataattgg tcgtaacaag ggatagtgag 

<210> 635 
<211> 31 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 635 

tggataattg gtcgtaacaa gggatagtga g 31 

<210> 636 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 636 

atatgattat cattgaactg cggccg 26 

<210> 637 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 637 

tatatgatta tcattgaact gcggccg 

<210> 638 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 638 

gcgtgacgac cttcttgaat tgtaatca 

<210> 639 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 639 

tgcgtgacga ccttcttgaa ttgtaatca 

<210> 640 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 640 

ttggacctgt aatcagctga atactgg 

<210> 641 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 641 

tttggacctg taatcagctg aatactgg 

<210> 642 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 642 

attgcccaga aatcaaatca tc 

<210> 643 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 643 

tattgcccag aaatcaaatc ate 

<210> 644 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 644 

tgtggccgat ttcaccacct gctcct 

<210> 645 
<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 645 

ttgtggccga tttcaccacc tgetect 

<210> 646 
<211> 23 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 646 

tctgggtgac ctggtgtttt aga 23 

<210> 647 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 647 

ttctgggtga cctggtgttt taga 24 

<210> 648 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 648 

agctgctaga tgagcttctg ccatggcc 28 



<210> 649 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 649 

tagctgctag atgagcttct gccatggcc 

<210> 650 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 650 

ccataaggtc accgtcacca ttcaaagc 

<210> 651 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 651 

tccataaggt caccgtcacc attcaaagc 

<210> 652 
<211> 23 
<212> DNA 



29 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 652 

ggaatttacc agcgatagac acc 23 

<210> 653 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 653 

tggaatttac cagcgataga cacc 24 

<210> 654 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 654 

tgccactttg acaactcctg ttgctg 2 6 



<210> 655 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 655 

ttgccacttt gacaactcct gttgctg 27 

<210> 656 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 656 

aatcgacgac catcttggaa agatttctc 29 
<210> 657 
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<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 657 

taatcgacga ccatcttgga aagatttctc 

<210> 658 
<211> 25 
<212> DNA 



30 



<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 658 

tcgacgacca tcttggaaag atttc 25 

<210> 659 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 659 

ccagcagtta ctgtcccctc atctttg 27 

<210> 660 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 660 

tccagcagtt actgtcccct catctttg 28 

<210> 661 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 661 

gctgctttga tggctgaatc cccttc 26 

<210> 662 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<400> 662 

tgctgctttg atggctgaat ccccttc 

<210> 663 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 663 

gggtctacac ctgcacttgc ataac 

<210> 664 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



tgggtctaca cctgcacttg cataac 

<210> 665 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 665 

atcccctgct tctgctgcc 

<210> 666 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 666 

tatcccctgc ttctgctgcc 

<210> 667 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 667 

taccttttcc acaacagaat cage 
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<210> 668 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Primer 
<400> 668 

ccaacctttt ccacaacaga atcagc 

<210> 669 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 669 

tccaaccttt tccacaacag aatcagc 

<210> 670 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 670 

cccatttttt cacgcatgct gaaaatatc 

<210> 671 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 671 

tcccattttt tcacgcatgc tgaaaatatc 

<210> 672 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 672 

gattggcgat aaagtgatat tttctaaaa 

<210> 673 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 673 

tgattggcga taaagtgata ttttctaaaa 

<210> 674 
<211> 26 
<212> DNA 



30 



<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 674 

gcccaccaga aagactagca ggataa 26 

<210> 675 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 675 

tgcccaccag aaagactagc aggataa 27 

<210> 67 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 67 6 

cctacccaac gttcaccaag ggcag 25 

<210> 677 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 67 7 

tcctacccaa cgttcaccaa gggcag 26 

<210> 67 8 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 678 

catgacagcc aagacctcac ccacc 25 
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<210> 679 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 679 

tcatgacagc caagacctca cccacc 

<210> 680 
<211> 15 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 680 
tgtgctttga atgct 

<210> 681 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 681 

tcatttgtgc tttgaatgct 

<210> 682 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 682 

tcataactag catttgtgct ttgaatgct 

<210> 683 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 683 

ttgcacgtct gtttcagttg caaattc 



<210> 684 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



126 



WO 2005/098047 



PCT/US2005/005356 



<220> 

<223> Primer 
<400> 684 

ttgcacgtct gtttcagttg caaattc 27 

<210> 685 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 685 

tctgtttcag ttgcaaattc 20 

<210> 686 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 686 

tgcacgtctg tttcagttgc aaattc 26 



<210> 687 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 687 

ttgcacgtct gtttcagttg caaa-ttc 

<210> 688 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 688 

tttcacagca tgcacgtctg tttcagttgc 

<210> 689 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
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ttgtgattgt tttgcagctg attgtg 

<210> 690 
<211> 13 
<212> DNA 

<213> Artificial Sequence 



<400> 690 
tgcagctgat tgt 

<210> 691 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 691 

tgattgtttt gcagctgatt gt 

<210> 692 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 692 

ttcaaaacct tgctctcgcc aaacaa 

<210> 693 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 693 

tacatcgttt cgcccaagat caatca 

<210> 694 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 694 

tcctcttttc acaggctcta cttcatc 

<210> 695 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 695 

tatttgggtt tcattccact cagattctgg 

<210> 696 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 696 

tgcgcgagct tttatttggg tttc 

<210> 697 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 697 

ttcaaaatgc ggaggcgtat gtg 

<210> 598 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 698 

tgcccag-gta caacctgcat 

<210> 6 99 
<211> 3 0 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 6 99 

tccaggcatt accatttcta ctccttctgg 

<210> 7 00 

<211> 2 5 

<212> DIM A 

<213> Axtificial Sequence 
<220> 

<223> Primer 
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<400> 7 00 

ggcatcacca tttccttgtc cttcg 

<210> 7 01 
<211> 2 6 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 7 01 

tggcatcacc atttccttgt ccttcg 

<210> 702 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 702 

gttgtcacca ggcattacca tttc 

<210> 703 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 703 

gttgtcgcca ggcataacca tttc 

<210> 704 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 704 

gccgtccatt tgagcagcac c 

<210> 705 
<211> 23. 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 705 

gccgtccatc tgagcagcac c 

<210> 70 6 
<211> 26 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 706 

tatagcacca tccatctgag cggcac 26 

<210> 707 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 707 

tatgtgctca cgagtttgcg gcat 

<210> 708 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 708 

tggatgtgct cacgagtctg tggcat 

<210> 709 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 709 

gcgctccacg tcttcacgc 

<210> 710 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 710 

acgaactgga tgtcgccgtt 

<210> 711 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 711 

cggtacgaac tggatgtcgc cgtt 

<210> 712 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 712 

tcggtacgaa ctggatgtcg ccgtt 

<210> 713 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 713 

ttcgcgcatc caggagaagt acatgtt 

<210> 714 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 714 

gcgttccaca gcttgttgca gaag 

<210> 715 
<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 715 

tcgcagttca tcagcacgaa gcg 

<210> 716 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 716 

tataacgcac atcgtcaggg tga 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 717 

caagcggttt gcctcaaata gtca 24 

<210> 718 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 718 

tggcacgagc ctgacctgt 19 



<210> 719 
<211> 1542 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 
<400> 719 

aaattgaaga gtttgatcat ggctcagatt 
gtcgaacggt aacaggaaga agcttgcttc 
tgtctgggaa actgcctgat ggagggggat 
aacgtcgcaa gaccaaagag ggggaccttc 
ggattagcta gtaggtgggg taacggctca 
ggatgaccag ccacactgga actgagacac 
ggaatattgc acaatgggcg caagcctgat 
tcgggttgta aagtactttc agcggggagg 
gacgttaccc gcagaagaag caccggctaa 
ggtgcaagcg ttaatcggaa ttactgggcg 
gatgtgaaat ccccgggctc aacctgggaa 
gtagaggggg gtagaattcc aggtgtagcg 
ggtggcgaag gcggccccct ggacgaagac 
aacaggatta gataccctgg tagtccacgc 
cttgaggcgt ggcttccgga gctaacgcgt 
aggttaaaac tcaaatgaat tgacgggggc 
tcgatgcaac gcgaagaacc ttacctggtc 
aatgtgcctt cgggaaccgt gagacaggtg 
aatgttgggt taagtcccgc aacgagcgca 
cgggaactca aaggagactg ccagtgataa 
atcatggccc ttacgaccag ggctacacac 
acctcgcgag agcaagcgga cctcataaag 
tcgactccat gaagtcggaa tcgctagtaa 
tcccgggcct tgtacacacc gcccgtcaca 
agcttaacct tcgggagggc gcttaccact 
caaggtaacc gtaggggaac ctgcggttgg 

<210> 720 
<211> 2904 



gaacgctggc ggcaggccta acacatgcaa 60 

tttgctgacg agtggcggac gggtgagtaa 120 

aactactgga aacggtagct aataccgcat 180 

gggcctcttg ccatcggatg tgcccagatg 240 

cctaggcgac gatccctagc tggtctgaga 300 

ggtccagact cctacgggag gcagcagtgg 360 

gcagccatgc cgcgtgtatg aagaaggcct 420 

aagggagtaa agttaatacc tttgctcatt 480 

ctccgtgcca gcagccgcgg taatacggag 540 

taaagcgcac gcaggcggtt tgttaagtca 600 

ctgcatctga tactggcaag cttgagtctc 660 

gtgaaatgcg tagagatctg gaggaatacc 720 

tgacgctcag gtgcgaaagc gtggggagca 780 

cgtaaacgat gtcgacttgg aggttgtgcc 840 

taagtccg-acc gcctggggag tacggccgca 900 

ccgcacaagc ggtggagcat gtggtttaat 960 

ttgacatcca cggaagtttt cagagatgag 1020 

ctgcatcjgct gtcgtcagct cgtgttgtga 1080 

acccttatcc tttgttgcca gcggtccggc 1140 

actggacjgaa ggtggggatg acgtcaagtc 1200 

gtgctacaat ggcgcataca aagagaagcg 1260 

tgcgtcgtag tccggattgg agtctgcaac 1320 

tcgtggatca gaatgccacg gtgaatacgt 1380 

ccatgggagt gggttgcaaa agaagtaggt 1440 

ttgtgattca tgactggggt gaagtcgtaa 1500 

atcacctcct ta 1542 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Escherichia coli 
<400> 720 

ggttaagcga ctaagcgtac acggtggatg ccctggcagt cagaggcgat gaaggacgtg 60 

ctaatctgcg ataagcgtcg gtaaggtgat atgaaccgtt ataaccggcgj atttccgaat 120 

ggggaaaccc agtgtgtttc gacacactat cattaactga atccataggt taatgaggcg 180 

aaccggggga actgaaacat ctaagtaccc cgaggaaaag aaatcaaccg- agattccccc 240 

agtagcggcg agcgaacggg gagcagccca gagcctgaat cagtgtgtgt gttagtggaa 300 

gcgtctggaa aggcgcgcga tacagggtga cagccccgta cacaaaaatg- cacatgctgt 3 60 

gagctcgatg agtagggcgg gacacgtggt atcctgtctg aatatggggcj gaccatcctc 420 

caaggctaaa tactcctgac tgaccgatag tgaaccagta ccgtgaggga aaggcgaaaa 480 

gaaccccggc gaggggagtg aaaaagaacc tgaaaccgtg tacgtacaag- cagtgggagc 540 

acgcttaggc gtgtgactgc gtaccttttg tataatgggt cagcgactta tattctgtag 600 

caaggttaac cgaatagggg agccgaaggg aaaccgagtc ttaactgggc gttaagttgc 660 

agggtataga cccgaaaccc ggtgatctag ccatgggcag gttgaaggtt gggtaacact 720 

aactggagga ccgaaccgac taatgttgaa aaattagcgg atgacttgtcj gctgggggtg 7 80 

aaaggccaat caaaccggga gatagctggt tctccccgaa agctatttagj gtagcgcctc 840 

gtgaattcat ctccgggggt agagcactgt ttcggcaagg gggtcatccc gacttaccaa 900 

cccgatgcaa actgcgaata ccggagaatg ttatcacggg agacacacgg- cgggtgctaa 960 

cgtccgtcgt gaagagggaa acaacccaga ccgccagcta aggtcccaaa gtcatggtta 1020 

agtgggaaac gatgtgggaa ggcccagaca gccaggatgt tggcttagaa gcagccatca 1080 

tttaaagaaa gcgtaatagc tcactggtcg agtcggcctg cgcggaagat gtaacggggc 114 0 

taaaccatgc accgaagctg cggcagcgac gcttatgcgt tgttgggtag gggagcgttc 1200 

tgtaagcctg cgaaggtgtg ctgtgaggca tgctggaggt atcagaagtg- cgaatgctga 1260 

cataagtaac gataaagcgg gtgaaaagcc cgctcgccgg aagaccaagg- gttcctgtcc 1320 

aacgttaatc ggggcagggt gagtcgaccc ctaaggcgag gccgaaaggc gtagtcgatg 138 0 

ggaaacaggt taatattcct gtacttggtg ttactgcgaa ggggggacgg agaaggctat 1440 

gttggccggg cgacggttgt cccggtttaa gcgtgtaggc tggttttcca ggcaaatccg 1500 

gaaaatcaag gctgaggcgt gatgacgagg cactacggtg ctgaagcaac aaatgccctg 1560 

cttccaggaa aagcctctaa gcatcaggta acatcaaatc gtaccccaaa ccgacacagg 1620 

tggtcaggta gagaatacca aggcgcttga gagaactcgg gtgaaggaac taggcaaaat 1680 

ggtgccgtaa cttcgggaga aggcacgctg atatgtaggt gaggtccctc gcggatggag 17 4 0 

ctgaaatcag tcgaagatac cagctggctg caactgttta ttaaaaacac agcactgtgc 1800 

aaacacgaaa gtggacgtat acggtgtgac gcctgcccgg tgccggaagg- ttaattgatg 1860 

gggttagcgc aagcgaagct cttgatcgaa gccccggtaa acggcggcccj taactataac 1920 

ggtcctaagg tagcgaaatt ccttgtcggg taagttccga cctgcacgaa tggcgtaatg 1980 

atggccaggc tgtctccacc cgagactcag tgaaattgaa ctcgctgtga agatgcagtg 2040 

tacccgcggc aagacggaaa gaccccgtga acctttacta tagcttgaca ctgaacattg 2100 

agccttgatg tgtaggatag gtgggaggct ttgaagtgtg gacgccagtc tgcatggagc 2160 

cgaccttgaa ataccaccct ttaatgtttg atgttctaac gttgacccgt aatccgggtt 2220 

gcggacagtg tctggtgggt agtttgactg gggcggtctc ctcctaaaga gtaacggagg 2280 

agcacgaagg ttggctaatc ctggtcggac atcaggaggt tagtgcaatg- gcataagcca 234 0 

gcttgactgc gagcgtgacg gcgcgagcag gtgcgaaagc aggtcatagt gatccggtgg 2400 

ttctgaatgg aagggccatc gctcaacgga taaaaggtac tccggggata acaggctgat 2460 

accgcccaag agttcatatc gacggcggtg tttggcacct cgatgtcggc tcatcacatc 2520 

ctggggctga agtaggtccc aagggtatgg ctgttcgcca tttaaagtgg- tacgcgagct 2580 

gggtttagaa cgtcgtgaga cagttcggtc cctatctgcc gtgggcgctg- gagaactgag 2 64 0 

gggggctgct cctagtacga gaggaccgga gtggacgcat cactggtgtt cgggttgtca 27 0 0 

tgccaatggc actgcccggt agctaaatgc ggaagagata agtgctgaaa gcatctaagc 2760 

acgaaacttg ccccgagatg agttctccct gaccctttaa gggtcctgaa ggaacgttga 2820 

agacgacgac gttgataggc cgggtgtgta agcgcagcga tgcgttgagc taaccggtac 288 0 
taatgaaccg tgaggcttaa cctt 2904 

<210> 721 
<211> 447 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Bacillus anthracis 



<400> 721 

atgtttggat cagatttata tattgcatta gtattaggag ttacactgag ccttattttt 60 

acagaaagaa caggtatttt acctgcaggt ttagttgtac ctggttattt agcactcgtt 120 

tttaatcagc ccgtatttat gttggttgtt ttatttatca gtattttaac atatgtaatc 180 

gttacgtatg gtgtttcaag attcatgatt ttatatggcc gtagaaaatt fcgcggcaacg 240 

ctaattacag gtatttgttt aaaactttta tttgattatt gttatcctgt tatgccattt 300 

gagatttttg aattccgtgg tattggagtt attgttccag gattaattgc aaatacaatt 360 

caaagacaag ggttaccatt aacaattgga actacaattt tgttaagtgg tgcaacattt 420 

gcaatcatga atatttatta cttattt 447 

<210> 722 
<211> 2339 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bacillus anthracis 



<400> 722 

atgactagaa ataaatttat acctaataag 
tttgctatat cctcctcaca ggctatagaa 
agtgatatta aaagaaacca taaaactgaa 
gacagtatta ataacttagt taaaacagaa 
cagacacaag acttattaaa aaagatacct 
ggaggagaaa tctattttac agatatagat 
agtgaagaag agaaaaatag tatgaatagt 
tttgtatttg aaaagaaaag ggaaacacct 
attaatagtg aacaaagtaa agaagtatat 
attataagta aggataaatc tctagatcca 
gatgatagtg atagtagcga ccttttattt 
aataataaaa gtatagatat aaattttata 
ttttctttag cgttttctta ttattttgca 
gcccccgaca tgtttgagta tatgaataag 
gaaagtttga agaaagaagg tgtggaaaaa 
gcacttaaag cttcaggttt agtaccagaa 
gaattaaata catatattct ttttaggcct 
agtggtgtgg ctacaaaggg attgaatgtt 
gctggataca taccatttga tcaagattta 
gagaaaggaa atttagaaaa taaaaaatca 
ataccattaa agttagacca tttaagaata 
aagggtaaaa aagaaattga taatggtaaa 
gtatatgaat ttagaattag cgatgaaaac 
aaaattactg ttttagggga aaaattcaat 
gtagaagggg tcttgaagcc gttaacagct 
ttaacagaaa taaaaaaaca aataccacaa 
aattcattag aaaagcaaaa aggtgttact 
aaaccggatt caactaaggg aactttatca 
aatgaagcag tcaaatatac aggatataca 
caagataatg aagagtttcc tgaaaaagat 
gaatttatat taactaaaaa ttgggagatg 
ggaaaagatt atttatatta ttttaaccgt 
gcttatattg agtggactga tccgattaca 
gcagagttta taaaaaactt atccagtatc 
gatagtggcg acaaagacga atttgcaaaa 
ttgtcagact attacaattc agcaaatcat 
tcaatatttc gtggaatcca agcctataat 
atagcaccag aatacaaaaa ttattttcaa 



tttagtatta tatccttttc agtattacta 60 

gtaaatgcta tgaatgaaca ttacactgag 120 

aaaaataaaa ctgaaaaaga aaaatttaaa 180 

tttaccaatg aaactttaga fcaaaatacag 240 

aaggatgtac ttgaaattta tagtgaatta 300 

ttagtagaac ataaggagtt acaagattta 3 60 

agaggtgaaa aagttccgtt tgcatcccgt 420 

aaattaatta taaatatcaa agattatgca 4 80 

tatgaaattg gaaaggggat ttctcttgat 540 

gagtttttaa atttaattaa cjagtttaagc 600 

agtcaaaaat ttaaagagaa gjctagaattg 660 

aaagaaaatt taactgaatt tcagcatgcg 7 20 

cctgaccata gaacggtatt agagttatat 7 80 

ttagaaaaag ggggatttga gjaaaataagt 8 40 

gataggattg atgtgctgaa aggagaaaaa 900 

catgcagatg cttttaaaaa aattgctaga 960 

gttaataagt tagctacaaa ccttattaaa 1020 

catggaaaga gttcggattg gjggccctgta 1080 

tctaagaagc atggtcaaca attagctgtc 1140 

attacagagc atgaaggtga aataggtaaa 1200 

gaagagttaa aggaaaatgg cjataattttg 1260 

aaatattatt tgttagaatc gjaataatcag 1320 

aacgaagtac aatacaagac aaaagaaggt 1380 

tggagaaata tagaagtgat g-gctaaaaat 1440 

gactatgatt tatttgcact tgccccaagt 1500 

aaagaatggg ataaagtagt taacacccca 1560 

aatttattga ttaaatatgg aattgagagg 162 0 

aattggcaaa aacaaatgct tgatcgtttg 1680 

gggggggatg tggttaacca tggcacagag 1740 

aacgaaattt ttataattaa tccagaaggt 1800 

acaggtagat ttatagaaaa a_aacattacg 1860 

tcttataata aaatagctcc tggtaataaa 1920 

aaagccaaaa taaataccat ccctacgtca 1980 

agaagatctt caaatgtagg a.gtttataaa 2040 

aaagaaagcg tgaaaaaaat tgcaggatat 2100 

attttttctc aggaaaaaaa g-cgtaaaata 2160 

gaaattgaaa atgttctaaa a_tctaaacaa 2220 

tatttaaagg ,aaaggattac caatcaagtt 2280 
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caattgcttc taacacatca aaaatctaat attgaattta aattattgta taaacaatt 2339 

<210> 723 
<211> 1917 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 
<400> 723 

atgggtaaaa taattggtat cgacctgggt actaccaact cttgtgtagc gattatggat 60 
ggcaccactc ctcgcgtgct ggagaacgcc gaaggcgatc gcaccacgcc ttctatcatt 120 
gcctataccc aggatggtga aactctagtt ggtcagccgg ctaaacgtca ggcagtgacg ISO 
aacccgcaaa acactctgtt tgcgattaaa cgcctgattg gtcgccgctt ccaggacgaa 24 0 
gaagtacagc gtgatgtttc catcatgccg ttcaaaatta ttgctgctga taacggcgac 3O0 
gcatgggtcg aagttaaagg ccagaaaatg gcaccgccgc agatttctgc tgaagtgctg 3S0 
aaaaaaatga agaaaaccgc tgaagattac ctgggtgaac cggtaactga agctgttatc 42 0 
accgtaccgg catactttaa cgatgctcag cgtcaggcaa ccaaagacgc aggccgtatc 4S0 
gctggtctgg aagtaaaacg tatcatcaac gaaccgaccg cagctgcgct ggcttacggt 54 0 
ctggacaaag gcactggcaa ccgtactatc gcggtttatg acctgggtgg tggtactttc 6O0 
gatatttcta ttatcgaaat cgacgaagtt gacggcgaaa aaaccttcga agttctggca 6S0 
accaacggtg atacccacct ggggggtgaa gacttcgaca gccgtctgat caactatctg 72 0 
gttgaagaat tcaagaaaga tcagggcatt gacctgcgca acgatccgct ggcaatgcag 7S0 
cgcctgaaag aagcggcaga aaaagcgaaa atcgaactgt cttccgctca gcagaccgac 84 0 
gttaacctgc catacatcac tgcagacgcg accggtccga aacacatgaa catcaaagtg 9O0 
actcgtgcga aactggaaag cctggttgaa gatctggtaa accgttccat tgagccgctg 9S0 
aaagttgcac tgcaggacgc tggcctgtcc gtatctgata tcgacgacgt tatcctcgtt 1O20 
ggtggtcaga ctcgtatgcc aatggttcag aagaaagttg ctgagttctt tggtaaagag 1O80 
ccgcgtaaag acgttaaccc ggacgaagct gtagcaatcg gtgctgctgt tcagggtggt 11.40 
gttctgactg gtgacgtaaa agacgtactg ctgctggacg ttaccccgct gtctctgggt 12 00 
atcgaaacca tgggcggtgt gatgacgacg ctgatcgcga aaaacaccac tatcccgacc 12 60 
aagcacagcc aggtgttctc taccgctgaa gacaaccagt ctgcggtaac catccatgtg 13 20 
ctgcagggtg aacgtaaacg tgcggctgat aacaaatctc tgggtcagtt caacctagat 13 80 
ggtatcaacc cggcaccgcg cggcatgccg cagatcgaag ttaccttcga tatcgatgct 14 40 
gacggtatcc tgcacgtttc cgcgaaagat aaaaacagcg gtaaagagca gaagatcacc 15 00 
atcaaggctt cttctggtct gaacgaagat gaaatccaga aaatggtacg cgacgcagaa 15 60 
gctaacgccg aagctgaccg taagtttgaa gagctggtac agactcgcaa ccagggcgac 15 20 
catctgctgc acagcacccg taagcaggtt gaagaagcag gcgacaaact gccggctgac 15 80 
gacaaaactg ctatcgagtc tgcgctgact gcactggaaa ctgctctgaa aggtgaagac 17 4 0 
aaagccgcta tcgaagcgaa aatgcaggaa ctggcacagg tttcccagaa actgatggaa 18 00 
atcgcccagc agcaacatgc ccagcagcag actgccggtg ctgatgcttc tgcaaacaac 18 60 
gcgaaagatg acgatgttgt cgacgctgaa tttgaagaag tcaaagacaa aaaataa 1^17 

<210> 724 
<211> 1647 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 
<400> 724 

atggcagcta aagacgtaaa attcggtaac gacgctcgtg tgaaaatgct 
aacgtactgg cagatgcagt gaaagttacc ctcggtccaa aaggccgtaa 
gataaatctt tcggtgcacc gaccatcacc aaagatggtg tttccgttgc 
gaactggaag acaagttcga aaatatgggt gcgcagatgg tgaaagaagt 
gcaaacgacg ctgcaggcga cggtaccacc actgcaaccg tactggctca 
actgaaggtc tgaaagctgt tgctgcgggc atgaacccga tggacctgaa 
gacaaagcgg ttaccgctgc agttgaagaa ctgaaagcgc tgtccgtacc 
tctaaagcga ttgctcaggt tggtaccatc tccgctaact ccgacgaaac 



gcgcggcgta 60 
cgtagttctg 12 0 
tcgtgaaatc 18 0 
tgcctctaaa 24 0 
ggctatcatc 30 0 
acgtggtatc 36 0 
atgctctgac 42 0 
cgtaggtaaa 48 0 
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ctgatcgctg aagcgatgga caaagtcggt 
accggtctgc aggacgaact ggacgtggtt 
tctccttact tcatcaacaa gccggaaact 
ctgctggctg acaagaaaat ctccaacatc 
gccaaagcag gcaaaccgct gctgatcatc 
actctggttg ttaacaccat gcgtggcatc 
ttcggcgatc gtcgtaaagc tatgctgcag 
atctctgaag agatcggtat ggagctggaa 
aaacgtgttg tgatcaacaa agacaccacc 
gcaatccagg gccgtgttgc tcagatccgt 
gaccgtgaaa aactgcagga acgcgtagcg 
gtgggtgctg ctaccgaagt tgaaatgaaa 
cacgcgaccc gtgctgcggt agaagaaggc 



aaagaaggcg ttatcaccgt tgaagacggt 54 0 
gaaggtatgc agttcgaccg tggctacctg 600 
ggcgcagtag aactggaaag cccgttcatc 660 
cgcgaaatgc tgccggttct ggaagctgtt 720 
gctgaagatg tagaaggcga agcgctggca 780 
gtgaaagtcg ctgcggttaa agcaccgggc 840 
gatatcgcaa ccctgactgg cggtaccgtg 900 
aaagcaaccc tggaagacct gggtcaggct 960 
actatcatcg atggcgtggg tgaagaagct 1020 
cagcagattg aagaagcaac ttctgactac 1080 
aaactggcag gcggcgttgc agttatcaaa 1140 
gagaaaaaag cacgcgttga agatgccctg 1200 
gtggttgctg gtggtggtgt tgcgctgatc 12 60 



cgcgtagcgt ctaaactggc tgacctgcgt 
aaagttgcac tgcgtgcaat ggaagctccg 
gaaccgtctg ttgttgctaa caccgttaaa 
gcaaccgaag aatacggcaa catgatcgac 
cgttctgctc tgcagtacgc agcttctgtg 
gttaccgacc tgccgaaaaa cgatgcagct 
atgggtggca tgggcggcat gatgtaa 

<210> 725 
<211> 1935 
<212> DNA 

<213> Artificial Seguence 



ggtcagaacg aagaccagaa cgtgggtatc 1320 
ctgcgtcaga tcgtattgaa ctgcggcgaa 1380 
ggcggcgacg gcaactacgg ttacaacgca 1440 
atgggtatcc tggatccaac caaagtaact 1500 
gctggcctga tgatcaccac cgaatgcatg 1560 
gacttaggcg ctgctggcgg tatgggcggc 1620 
1647 



<220> 

<223> Escherichia coli 



<400> 725 

atggcgaaaa acctaatact ctggctggtc 
agctttgggc ccagcgagtc taatggccgt 
gtcaataacg accaggttcg tgaagcgcgt 
aaagatagta accgttatac cacttacatt 
ctgttgacca agaacgtcaa ggttgtcggt 
tctatcttca tctcctggtt cccgatgctg 
cgtcaaatgc agggcggcgg tggcaaaggt 
atgctgacgg aagatcagat caaaacgacc 
aaagaagaag ttgctgaact ggttgagtat 
ggcggtaaga tcccgaaagg cgtcttgatg 
ctggcgaaag cgattgcagg cgaagcgaaa 
ttcgtagaaa tgttcgtcgg tgtgggtgca 
aagaaagcgg caccgtgcat catctttatc 



attgccgttg tgctgatgtc agtattccag 60 
aaggtggatt actctacctt cctacaagag 12 0 
atcaacggac gtgaaatcaa cgttaccaag 180 
ccggttcagg atccgaaatt actggataac 24 0 
gaaccgcctg aagaaccaag cctgctggct 300 
ttgctgattg gtgtctggat cttcttcatg 360 
gccatgtcgt ttggtaagag caaagcgcgc 420 
tttgctgacg ttgcgggctg cgacgaagca 480 
ctgcgcgagc cgagccgctt ccagaaactc 54 0 
gtcggtcctc cgggtaccgg taaaacgctg 600 
gttccgttct ttactatctc cggttctgac 660 
tcccgtgttc gtgacatgtt cgaacaggcg 720 
gatgaaatcg acgccgtagg ccgccagcgt 780 



ggcgctggtc tgggcggtgg tcacgatgaa 
gagatggatg gcttcgaagg taacgaaggt 
gacgttctcg acccggccct gctgcgtcct 
ttgccagatg ttcgcggtcg tgagcagatc 
gcacccgata tcgacgcggc aatcattgcc 
ctggcgaacc tggtgaacga agcggcactg 
tcgatggttg agttcgagaa agcgaaagac 
atggtgatga cggaagcgca gaaagaatcg 
atcggtcgcc tggtgccgga acacgatccg 
cgtgcgctgg gtgtgacttt cttcttgcct 
aaactggaaa gccagatttc tacgctgtac 
gggccggaac atgtatctac cggtgcgtcc 
cgtaacatgg tgactcagtg gggcttctct 
gaagaaggtg aagtgttcct cggccgtagc 
actgcacgta tcatcgacca ggaagtgaaa 
cgtcagcttc tgaccgacaa tatggatatt 



cgtgaacaga ctctgaacca gatgctggtt 84 0 
atcatcgtta tcgccgcgac taaccgtccg 900 
ggccgtttcg accgtcaggt tgtggtcggc 960 
ctgaaagttc acatgcgtcg cgtaccattg 1020 
cgtggtactc ctggtttctc cggtgctgac 1080 
ttcgctgctc gtggcaacaa acgcgttgtg 1140 
aaaatcatga tgggtgcgga acgtcgctcc 1200 
acggcttacc acgaagcggg tcatgcgatt 1260 
gtgcacaaag tgacgattat cccacgcggt 1320 
gagggcgacg caatcagcgc cagccgtcag 1380 
ggtggtcgtc tggcagaaga gatcatctac 14 40 
aacgatatta aagttgcgac caacctggca 1500 
gagaaattgg gtccactgct gtacgcggaa 15 60 
gtagcgaaag cgaaacatat gtccgatgaa 1620 
gcactgattg agcgtaacta taatcgtgcg 1680 
ctgcatgcga tgaaagatgc tctcatgaaa 17 4 0 
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tatgagacta tcgacgcacc gcagattgat gacctgatgg cacgtcgcga tgtacgtccg 1800 

ccagcgggct gggaagaacc aggcgcttct aacaattctg gcgacaatgg tagtccaaag 1860 

gctcctcgtc cggttgatga accgcgtacg ccgaacccgg gtaacaccat gtcagagcag 1920 

ttaggcgaca agtaa 1935 

<210> 726 
<211> 2673 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 



<400> 726 

atgacagatg taacgattaa 
gtacagcaat ttgctgatgc 
gagaaacaga ctttgattga 
ctgcaacgta aaacacgcag 
gtacaaatcg aagtccgcaa 
cgccttgcag cggaagagca 
gaagaatcgg ctaaacgcga 
aagcgtgaag ctgctgaaca 
caacaagacg atatgactaa 
gcagagctca agcgtaaagc 
cgcgttgctg aagaagcacg 
ccgactgaag attccagcga 
gacgaaagcg atcgtgaagt 
cgtccgaaga aaggcaacaa 
gcagtacgtg gcggtaaagg 
aagcctgctc aggccgttaa 
ctggcgaaca agatggcggt 
gcaatggcaa ccatcaacca 
atgggccata aagttatcct 
cgtgacacgg gtgctgcggc 
gaccacggta aaacctctct 
gcgggcggca ttacccagca 
accttcctgg acaccccggg 
gcaacggaca tcgtagtcct 
gaagcaatcc agcacgcgaa 
gataaaccag aagctgatcc 
ccggaagagt ggggcggtga 
atcgatgaac tgctggacgc 
cgtaaaggta tggcgagcgg 
gttgctaccg ttctggtacg 
ttcgaatacg gtcgtgttcg 
ggtccgtcca ttccggtgga 
gttaccgttg tacgtgacga 
ttccgcgaag ttaaactggc 
atgaccgaag gcgaagttca 
gtcgaagcga tctccgactc 
atcggttctg gcgtaggtgg 
gccatcctgg ttggctttaa 
gaaagcctgg atctgcgtta 
gcgatgagcg gtatgctgtc 
cgtgacgtgt tcaaatcgcc 
gtggttaaac gtcacaaccc 
gagctggagt ccctgcgccg 
tgtggtatcg gcgttaagaa 
gaaatcatcg agatccaacg 



aacgctggcc gcagagcgac 
aggtatccgg aagtctgctg 
ccacctgaat cagaaaaatt 
cacccttaac attcctggta 
gaaacgcacc tttgtgaaac 
agcgcagcgt gaagcggaag 
ggcgcaacaa aaagctgaac 
agcgaaacgt gaagctgcgg 
aaacgcccag gctgaaaaag 
tgaagaagaa gcgcgtcgta 
tcgtatggcg gaagaaaaca 
ttatcacgtc actacttctc 
cgaaggcggc cgtggccgtg 
acacgctgaa tcaaaagctg 
cggaaaacgt aaaggttctt 
ccgtgacgtt gtgatcggcg 
taaaggctct caggtcatca 
ggttatcgat caggaaaccg 
gcgtcgtgaa aacgagctgg 
tgaaccgcgc gcgccggttg 
gctggactac attcgttcaa 
cattggtgca taccacgttg 
gcacgccgcg tttacttcaa 
ggttgttgct gccgacgacg 
agcggcgcag gtaccggtgg 
ggatcgcgtt aagaacgaac 
aagccagttc gtacacgtat 
tatcctgctg caggcggaag 
tgcggttatc gaatccttcc 
tgaaggtact ctgcacaagg 
tgcgatgcgt aacgaactgg 
aatcctcggc ctgtccggcg 
gaagaaagcg cgtgaagttg 
gcgtcagcag aaatctaaac 
cgaagtgaat atcgtcctga 
cttgctgaaa ctgtctactg 
tatcaccgaa accgacgcca 
cgtacgtgct gatgcctctg 
ctactccgtc atctataacc 
tccggaactg aaacagcaga 
gaaatttggt gccatcgcag 
gatccgcgtt ctgcgtgaca 
cttcaaagat gacgttaacg 
ctacaacgac gtccgcactg 
taccattgct taa 



agacctccgt ggaacgcctg 60 
acgactctgt gtctgcacaa 120 
caggcccgga caaattgacg 180 
ccggtggaaa aagcaaatcg 240 
gcgatccgca agaggctgaa 300 
agcaagcccg tcgtgaggca 360 
gtgaggccgc agaacaagct 420 
aaaaagacaa agtgagcaat 480 
cccgccgtga gcaggaagct 540 
aactcgaaga agaagcacgt 600 
aatggactga taacgcggaa 660 
aacatgctcg ccaggcagaa 720 
gtcgtaacgc gaaagcagcg 780 
atcgtgaaga agcacgcgca 840 
cgctgcagca aggcttccag 900 
aaactatcac cgttggcgaa 960 
aagcgatgat gaaactgggc 1020 
cacagctggt tgctgaagag 1080 
aagaggcggt aatgagcgac 1140 
tgaccatcat gggtcacgtt 1200 
cgaaagtggc ctctggcgaa 1260 
aaactgaaaa cggcatgatc 1320 
tgcgtgctcg tggtgcgcag 1380 
gtgtgatgcc gcagaccatc 1440 
tggttgcagt gaacaagatc 1500 
tctcccagta cggcatcctg 1560 
ctgcgaaagc gggtaccggt 1620 
ttctggagct gaaagcggta 1680 
tcgataaagg tcgtggtccg 1740 
gcgatatcgt tctgtgtggc 1800 
gtcaggaagt gctggaagcg 18 60 
taccggctgc gggtgatgaa 1920 
cactctatcg tcagggtaaa 1980 
tcgagaacat gttcgccaac 2040 
aggcagacgt acagggttct 2100 
acgaagttaa agtgaagatc 2160 
ccctggctgc ggcgtccaac 2220 
cacgtaaagt gattgaagcg 2280 
tgattgacga agtgaaagcg 2340 
ttatcggtct ggcggaagtt 2400 
gctgtatggt taccgaaggt 2460 
acgtggttat ctacgaaggc 2520 
aagtccgtaa cggtatggaa 2580 
gcgatgtgat cgaagtattc 2640 
2673 
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<210> 727 
<211> 2480 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bacillus anthracis 
<400> 727 

atgaatataa aaaaagaatt tataaaagta 

actttgagtg gtcccgtctt tatccccctt 

ggtatgcacg taaaagagaa agagaaaaat 

cgaaataaaa cacaggaaga gcatttaaag 

gtaaaagggg aggaagctgt taaaaaagag 

tctgatgttt tagagatgta taaagcaatt 

attacaaaac atatatcttt agaagcatta 

tatgggaaag atgctttatt acatgaacat 

gtacttgtaa tccaatcttc ggaagattat 

tattatgaaa taggtaagat attatcaagg 

cagaaatttt tagatgtatt aaataccatt 

cttttattta ctaatcagct taaggaacat 

caaaatagca atgaggtaca agaagtattt 

cagcatcgtg atgttttaca gctttatgca 

aacgaacaag aaataaatct atccttggaa 

tatgaaaaat gggaaaagat aaaacagcac 

gaaggaagag gacttttaaa aaagctgcag 

attcattctt tatctcaaga agaaaaagag 

gattttttat ctactgagga aaaagagttt 

tctttatctg aagaagaaaa agagctttta 

ttatctgaaa aagaaaaaga gtttttaaaa 

attaatcaaa ggttgcaaga tacaggaggg 

gtaagaaagc agtataaaag ggatattcaa 

ggaagtacct tgtacaataa aatttatttg 

gcaaccctag gtgcggattt agttgattcc 

ttcaatgaat tcaaaaaaaa tttcaaatat 

ataaatgaaa ggcctgcatt agataatgag 

gatactcgag caggatattt agaaaatgga 

gaaataaagg atgtacaaat aattaagcaa 

aaagtagtgc caaagagtaa aatagataca 

caggaatgga ataaagcatt agggttacca 

cataatagat atgcatccaa tattgtagaa 

aataatattc aaagtgatct tataaaaaag 

agatttgttt ttaccgatat tactctccct 

gagatatatg agcaagttca ttcaaaaggg 

ctccatggac cttcaaaagg tgtagaatta 

tttggacatg ctgtggatga ttatgctgga 

gttacaaatt ctaaaaaatt cattgatatt 

tatgggagaa caaatgaagc ggaatttttt 

gaccatgctg aacgtttaaa agttcaaaaa 

gatcagatta agttcattat taactcataa 

aataataata ataataataa 

<210> 728 
<211> 2295 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bacillus anthracis 
<400> 728 



attagtatgt catgtttagt aacagcaatt 60 
gtacaggggg cgggcggtca tggtgatgta 12 0 
aaagatgaga ataagagaaa agatgaagaa 180 
gaaatcatga aacacattgt aaaaatagaa 24 0 
gcagcagaaa agctacttga gaaagtacca 300 
ggaggaaaga tatatattgt ggatggtgat 360 
tctgaagata agaaaaaaat aaaagacatt 420 
tatgtatatg caaaagaagg atatgaaccc 480 
gtagaaaata ctgaaaaggc actgaacgtt 54 0 
gatattttaa gtaaaattaa tcaaccatat 600 
aaaaatgcat ctgattcaga tggacaagat 660 
cccacagact tttctgtaga attcttggaa 720 
gcgaaagctt ttgcatatta tatcgagcca 780 
ccggaagctt ttaattacat ggataaattt 84 0 
gaacttaaag atcaacggat gctgtcaaga 90 0 
tatcaacact ggagcgattc tttatctgaa 960 
attcctattg agccaaagaa agatgacata 1020 
cttctaaaaa gaatacaaat tgatagtagt 1080 
ttaaaaaagc tacaaattga tattcgtgat 1140 
aatagaatac aggtggatag tagtaatcct 1200 
aagctgaaac ttgatattca accatatgat 1260 
ttaattgata gtccgtcaat taatcttgat 1320 
aatattgatg ctttattaca tcaatccatt 1380 
tatgaaaata tgaatatcaa taaccttaca 1440 
actgataata ctaaaattaa tagaggtatt 1500 
agtatttcta gtaactatat gattgttgat 1560 
cgtttgaaat ggagaatcca attatcacca 1620 
aagcttatat tacaaagaaa catcggtctg 1680 
tccgaaaaag aatatataag gattgatgcg 17 40 
aaaattcaag aagcacagtt aaatataaat 1800 
aaatatacaa agcttattac attcaacgtg 18 60 
agtgcttatt taatattgaa tgaatggaaa 1920 
gtaacaaatt acttagttga tggtaatgga 1980 
aatatagctg aacaatatac acatcaagat 2040 
ttatatgttc cagaatcccg ttctatatta 2100 
aggaatgata gtgagggttt tatacacgaa 2160 
tatctattag ataagaacca atctgattta 2220 
tttaaggaag aagggagtaa tttaacttcg 2280 
gcagaagcct ttaggttaat gcattctacg 2340 
aatgctccga aaactttcca atttattaac 2400 
gtaatgtatt aaaaattttc aaatggattt 24 60 
2480 
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atgaaaaaac gaaaagtgtt aataccatta 
acaggtaatt tagaggtgat tcaggcagaa 
tcagaatcaa gttcccaggg gttactagga 
cccatggtgg ttacctcttc tactacaggg 
aatattccat cggaaaacca atattttcaa 
aagaagagtg atgaatatac atttgctact 
gatgaccaag aagtgattaa taaagcttct 
agattatatc aaataaaaat tcaatatcaa 
ttcaagttgt actggaccga ttctcaaaat 
caattgccag aattaaaaca aaaatcttcg 
ggacctacgg ttccagaccg tgacaatgat 
tatacggttg atgtcaaaaa taaaagaact 
gaaaagaaag gattaaccaa atataaatca 
ccgtacagtg atttcgaaaa ggttacagga 
agacaccccc ttgtggcagc ttatccgatt 
tcaaaaaatg aggatcaatc cacacagaat 
aatacttcta caagtaggac acatactagt 
tcgttctttg atattggtgg gagtgtatct 
gtcgcaattg atcattcact atctctagca 
ttaaataccg ctgatacagc aagattaaat 
gctccaatct acaacgtgtt accaacgact 
gcgacaatta aagctaagga aaaccaatta 
ccttctaaaa acttggcgcc aatcgcatta 
attacaatga attacaatca atttcttgag 
acggatcaag tatatgggaa tatagcaaca 
gatacaggct cgaactggag tgaagtgtta 
atttttaatg gaaaagattt aaatctggta 
gatccattag aaacgactaa accggatatg 
ggatttaacg aaccgaatgg aaacttacaa 
tttaatttcg atcaacaaac atctcaaaat 
actaacatat atactgtatt agataaaatc 
agagataaac gttttcatta tgatagaaat 
gttaaggagg ctcatagaga agtaattaat 
gataaggata taagaaaaat attatcaggt 
cttaaagaag ttataaatga cagatatgat 
ggaaaaacat ttatagattt taaaaaatat 
cccaattata aggtaaatgt atatgctgtt 
gagaatgggg atactagtac caacgggatc 
tatgagatag gataa 



atggcattgt ctacgatatt agtttcaagc 60 
gttaaacagg agaaccggtt attaaatgaa 12 0 
tactatttta gtgatttgaa ttttcaagca 18 0 
gatttatcta ttcctagttc tgagttagaa 240 
tctgctattt ggtcaggatt tatcaaagtt 300 
tccgctgata atcatgtaac aatgtgggta 360 
aattctaaca aaatcagatt agaaaaagga 420 
cgagaaaatc ctactgaaaa aggattggat 480 
aaaaaagaag tgatttctag tgataactta 54 0 
aactcaagaa aaaagcgaag tacaagtgct 600 
ggaatccctg attcattaga ggtagaagga 660 
tttctttcac catggatttc taatattcat 720 
tctcctgaaa aatggagcac ggcttctgat 780 
cggattgata agaatgtatc accagaggca 840 
gtacatgtag atatggagaa tattattctc 900 
actgatagtc aaacgagaac aataagtaaa 960 
gaagtacatg gaaatgcaga agtgcatgcg 1020 
gcaggattta gtaattcgaa ttcaagtacg 1080 
ggggaaagaa cttgggctga aacaatgggt 1140 
gccaatatta gatatgtaaa tactgggacg 1200 
tcgttagtgt taggaaaaaa tcaaacactc 1260 
agtcaaatac ttgcacctaa taattattat 1320 
aatgcacaag acgatttcag ttctactcca 1380 
ttagaaaaaa cgaaacaatt aagattagat 1440 
tacaattttg aaaatggaag agtgagggtg 1500 
ccgcaaattc aagaaacaac tgcacgtatc 15 60 
gaaaggcgga tagcggcggt taatcctagt 1620 
acattaaaag aagcccttaa aatagcattt 1680 
tatcaaggga aagacataac cgaatttgat 1740 
atcaagaatc agttagcgga attaaacgca 1800 
aaattaaatg caaaaatgaa tattttaata 1860 
aacatagcag ttggggcgga tgagtcagta 1920 
tcgtcaacag agggattatt gttaaatatt 1980 
tatattgtag aaattgaaga tactgaaggg 2040 
atgttgaata tttctagttt acggcaagat 2100 
aatgataaat taccgttata tataagtaat 2160 
actaaagaaa acactattat taatcctagt 2220 
aagaaaattt taatcttttc taaaaaaggc 2280 
2295 



<21C> 729 
<211> 822 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 



<400> 729 

atggcagttg ttaaatgtaa accgacatct 
aaccctgagc tgcacaaggg caaacctttt 
ggtggtcgta acaacaatgg ccgtatcacc 
gcttaccgta ttgttgactt caaacgcaac 
cttgagtacg atccgaaccg ttccgcgaac 
cgccgttaca tcctggcccc taaaggcctg 
gatgctgcaa tcaaaccagg taacaccctg 
gttcataacg tagaaatgaa accaggtaaa 
tacgttcaga tcgttgctcg tgatggtgct 
atgcgtaaag tagaagcaga ctgccgtgca 
atgctgcgcg ttctgggtaa agcaggtgct 
cgcggtaccg cgatgaaccc ggtagaccac 



ccgggtcgtc gccacgtagt taaagtggtt 60 
gctccgttgc tggaaaaaaa cagcaaatcc 120 
actcgtcata tcggtggtgg ccacaagcag 180 
aaagacggta tcccggcagt tgttgaacgt 240 
atcgcgctgg ttctgtacaa agacggtgaa 300 
aaagctggcg accagattca gtctggcgtt 3 60 
ccgatgcgca acatcccggt tggttctact 420 
ggcggtcagc tggcacgttc cgctggtact 480 
tatgtcaccc tgcgtctgcg ttctggtgaa 540 
actctgggcg aagttggcaa tgctgagcat 600 
gcacgctggc gtggtgttcg tccgaccgtt 6 60 
ccacatggtg gtggtgaagg tcgtaacttt 720 
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ggtaagcacc cggtaactcc gtggggcgtt cagaccaaag gtaagaagac ccgcagcaac 780 
aagcgtactg ataaattcat cgtacgtcgc cgtagcaaat aa 822 

<210> 730 
<211> 4029 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 



<400> 730 

atggtttact cctataccga gaaaaaacgt 

gttctggatg taccttatct cctttctatc 

caagatcctg aagggcagta tggtctggaa 

agctacagcg gtaattccga gctgcaatac 

gacgtccagg aatgtcaaat ccgtggcgtg 

cgtctggtga tctatgagcg cgaagcgccg 



attcgtaagg attttggtaa acgtccacaa 60 
cagcttgact cgtttcagaa atttatcgag 120 
gctgctttcc gttccgtatt cccgattcag 180 
gtcagctacc gccttggcga accggtgttt 240 
acctattccg caccgctgcg cgttaaactg 300 
gaaggcaccg taaaagacat taaagaacaa 360 



gaagtctaca tgggcgaaat tccgctcatg acagacaacg gtacctttgt tatcaacggt 420 

actgagcgtg ttatcgtttc ccagctgcac cgtagtccgg gcgtcttctt tgactccgac 4B0 

aaaggtaaaa cccactcttc gggtaaagtg ctgtataacg cgcgtatcat cccttaccgt 540 

ggttcctggc tggacttcga attcgatccg aaggacaacc tgttcgtacg tatcgaccgt 600 

cgccgtaaac tgcctgcgac catcattctg cgcgccctga actacaccac agagcagatc 660 

ctcgacctgt tctttgaaaa agttatcttt gaaatccgtg ataacaagct gcagatggaa 720 

ctggtgccgg aacgcctgcg tggtgaaacc gcatcttttg acatcgaagc taacggtaaa 7 80 

gtgtacgtag aaaaaggccg ccgtatcact gcgcgccaca ttcgccagct ggaaaaagac 840 

gacgtcaaac tgatcgaagt cccggttgag tacatcgcag gtaaagtggt tgctaaagac 900 

tatattgatg agtctaccgg cgagctgatc tgcgcagcga acatggagct gagcctggat 960 

ctgctggcta agctgagcca gtctggtcac aagcgtatcg aaacgctgtt caccaacgat 1020 

ctggatcacg gcccatatat ctctgaaacc ttacgtgtcg acccaactaa cgaccgtctg 1080 

agcgcactgg tagaaatcta ccgcatgatg cgccctggcg agccgccgac tcgtgaagca 114 0 

gctgaaagcc tgttcgagaa cctgttcttc tccgaagacc gttatgactt gtctgcggtt 1200 

ggtcgtatga agttcaaccg ttctctgctg cgcgaagaaa tcgaaggttc cggtatcctg 1260 

agcaaagacg acatcattga tgttatgaaa aagctcatcg atatccgtaa cggtaaaggc 1320 

gaagtcgatg atatcgacca cctcggcaac cgtcgtatcc gttccgttgg cgaaatggcg 1380 

gaaaaccagt tccgcgttgg cctggtacgt gtagagcgtg cggtgaaaga gcgtctgtct 1440 

ctgggcgatc tggataccct gatgccacag gatatgatca acgccaagcc gatttccgca 1500 

gcagtgaaag agttcttcgg ttccagccag ctgtctcagt ttatggacca gaacaacccg 1560 

ctgtctgaga ttacgcacaa acgtcgtatc tccgcactcg gcccaggcgg tctgacccgt 1620 

gaacgtgcag gcttcgaagt tcgagacgta cacccgactc actacggtcg cgtatgtcca 1680 

atcgaaaccc ctgaaggtcc gaacatcggt ctgatcaact ctctgtccgt gtacgcacag 1740 

actaacgaat acggcttcct tgagactccg tatcgtaaag tgaccgacgg tgttgtaact 1800 

gacgaaattc actacctgtc tgctatcgaa gaaggcaact acgttatcgc ccaggcgaac 1860 

tccaacttgg atgaagaagg ccacttcgta gaagacctgg taacttgccg tagcaaaggc 1920 

gaatccagct tgttcagccg cgaccaggtt gactacatgg acgtatccac ccagcaggtg 1980 

gtatccgtcg gtgcgtccct gatcccgttc ctggaacacg atgacgccaa ccgtgcattg 2040 

atgggtgcga acatgcaacg tcaggccgtt ccgactctgc gcgctgataa gccgctggtt 2100 

ggtactggta tggaacgtgc tgttgccgtt gactccggtg taactgcggt agctaaacgt 2160 

ggtggtgtcg ttcagtacgt ggatgcttcc cgtatcgtta tcaaagttaa cgaagacgag 222 0 

atgtatccgg gtgaagcagg tatcgacatc tacaacctga ccaaatacac ccgttctaac 2280 

cagaacacct gtatcaacca gatgccgtgt gtgtctctgg gtgaaccggt tgaacgtggc 2340 

gacgtgctgg cagacggtcc gtccaccgac ctcggtgaac tggcgcttgg tcagaacatg 2400 

cgcgtagcgt tcatgccgtg gaatggttac aacttcgaag actccatcct cgtatccgag 2460 

cgtgttgttc aggaagaccg tttcaccacc atccacattc aggaactggc gtgtgtgtcc 2520 

cgtgacacca agctgggtcc ggaagagatc accgctgaca tcccgaacgt gggtgaagct 2580 

gcgctctcca aactggatga atccggtatc gtttacattg gtgcggaagt gaccggtggc 2640 

gacattctgg ttggtaaggt aacgccgaaa ggtgaaactc agctgacccc agaagaaaaa 2700 

ctgctgcgtg cgatcttcgg tgagaaagcc tctgacgtta aagactcttc tctgcgcgta 2760 
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ccaaacgg-tg tatccggtac ggttatcgac gttcaggtct ttactcgcga tggcgtagaa 2820 

aaagacaaac gtgcgctgga aatcgaagaa atgcagctca aacaggcgaa gaaagacctg 2880 

tctgaagaac tgcagatcct cgaagcgggt ctgttcagcc gtatccgtgc tgtgctggta 2940 

gccggtggcg ttgaagctga gaagctcgac aaactgccgc gcgatcgctg gctggagctg 3000 

ggcctgacag acgaagagaa acaaaatcag ctggaacagc tggctgagca gtatgacgaa 3060 

ctgaaacacg agttcgagaa gaaactcgaa gcgaaacgcc gcaaaatcac ccagggcgac 3120 

gatctggcac cgggcgtgct gaagattgtt aaggtatatc tggcggttaa acgccgtatc 3180 

cagcctgg-tg acaagatggc aggtcgtcac ggtaacaagg gtgtaatttc taagatcaac 3240 

ccgatcgaag atatgcctta cgatgaaaac ggtacgccgg tagacatcgt actgaacccg 3300 

ctgggcgtac cgtctcgtat gaacatcggt cagatcctcg aaacccacct gggtatggct 3360 

gcgaaagg-ta tcggcgacaa gatcaacgcc atgctgaaac agcagcaaga agtcgcgaaa 3420 

ctgcgcgaat tcatccagcg tgcgtacgat ctgggcgctg acgttcgtca gaaagttgac 3480 

ctgagtacct tcagcgatga agaagttatg cgtctggctg aaaacctgcg caaaggtatg 3540 

ccaatcgcaa cgccggtgtt cgacggtgcg aaagaagcag aaattaaaga gctgctgaaa 3600 

cttggcgacc tgccgacttc cggtcagatc cgcctgtacg atggtcgcac tggtgaacag 3660 

ttcgagcg-tc cggtaaccgt tggttacatg tacatgctga aactgaacca cctggtcgac 3720 

gacaagatgc acgcgcgttc caccggttct tacagcctgg ttactcagca gccgctgggt 3780 

ggtaaggcac agttcggtgg tcagcgtttc ggggagatgg aagtgtgggc gctggaagca 3840 

tacggcgcag catacaccct gcaggaaatg ctcaccgtta agtctgatga cgtgaacggt 3900 

cgtaccaaga tgtataaaaa catcgtggac ggcaaccatc agatggagcc gggcatgcca 3960 

gaatccttca acgtattgtt gaaagagatt cgttcgctgg gtatcaacat cgaactggaa 4020 
gacgagtaa 4029 

<210> 731 
<211> 4224 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 
<400> 731 

gtgaaagatt tattaaagtt tctgaaagcg cagactaaaa ccgaagagtt tgatgcgatc 60 

aaaattgctc tggcttcgcc agacatgatc cgttcatggt ctttcggtga agttaaaaag 120 

ccggaaacca tcaactaccg tacgttcaaa ccagaacgtg acggcctttt ctgcgcccgt 180 

atctttgggc cggtaaaaga ttacgagtgc ctgtgcggta agtacaagcg cctgaaacac 240 

cgtggcgtca tctgtgagaa gtgcggcgtt gaagtgaccc agactaaagt acgccgtgag 300 

cgtatgggcc acatcgaact ggcttccccg actgcgcaca tctggttcct gaaatcgctg 360 

ccgtcccg-ta tcggtctgct gctcgatatg ccgctgcgcg atatcgaacg cgtactgtac 420 

tttgaatcct atgtggttat cgaaggcggt atgaccaacc tggaacgtca gcagatcctg 480 

actgaagagc agtatctgga cgcgctggaa gagttcggtg acgaattcga cgcgaagatg 540 

ggggcggaag caatccaggc tctgctgaag agcatggatc tggagcaaga gtgcgaacag 600 

ctgcgtgaag agctgaacga aaccaactcc gaaaccaagc gtaaaaagct gaccaagcgt 660 

atcaaactgc tggaagcgtt cgttcagtct ggtaacaaac cagagtggat gatcctgacc 720 

gttctgccgg tactgccgcc agatctgcgt ccgctggttc cgctggatgg tggtcgtttc 780 

gcgacttctg acctgaacga tctgtatcgt cgcgtcatta accgtaacaa ccgtctgaaa 840 

cgtctgctgg atctggctgc gccggacatc atcgtacgta acgaaaaacg tatgctgcag 900 

gaagcggtag acgccctgct ggataacggt cgtcgcggtc gtgcgatcac cggttctaac 960 

aagcgtcctc tgaaatcttt ggccgacatg atcaaaggta aacagggtcg tttccgtcag 1020 

aacctgctcg gtaagcgtgt tgactactcc ggtcgttctg taatcaccgt aggtccatac 1080 

ctgcgtctgc atcagtgcgg tctgccgaag aaaatggcac tggagctgtt caaaccgttc 1140 

atctacggca agctggaact gcgtggtctt gctaccacca ttaaagctgc gaagaaaatg 1200 

gttgagcgcg aagaagctgt cgtttgggat atcctggacg aagttatccg cgaacacccg 12 60 

gtactgctga accgtgcacc gactctgcac cgtctgggta tccaggcatt tgaaccggta 1320 

ctgatcgaag gtaaagctat ccagctgcac ccgctggttt gtgcggcata taacgccgac 1380 

ttcgatgg-tg accagatggc tgttcacgta ccgctgacgc tggaagccca gctggaagcg 1440 

cgtgcgctga tgatgtctac caacaacatc ctgtccccgg cgaacggcga accaatcatc 1500 

gttccgtc-tc aggacgttgt actgggtctg tactacatga cccgtgactg tgttaacgcc 1560 

aaaggcgaag gcatggtgct gactggcccg aaagaagcag aacgtctgta tcgctctggt 1620 

ctggcttc-tc tgcatgcgcg cgttaaagtg cgtatcaccg agtatgaaaa agatgctaac 1680 

ggtgaattag tagcgaaaac cagcctgaaa gacacgactg ttggccgtgc cattctgtgg 1740 
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atgattgtac cgaaaggtct gccttactcc atcgtcaacc aggcgctggg taaaaaagca 1800 
atctccaaaa tgctgaacac ctgctaccgc attctcggtc tgaaaccgac cgttattttt 1860 
gcggaccaga tcatgtacac cggcttcgcc tatgcagcgc gttctggtgc atctgttggt 1920 
atcgatgaca tggtcatccc ggagaagaaa cacgaaatca tctccgaggc agaagcagaa 1980 
gttgctgaaa ttcaggagca gttccagtct ggtctggtaa ctgcgggcga acgctacaac 2040 
aaagttatcg atatctgggc tgcggcgaac gatcgtgtat ccaaagcgat gatggataac 2100 
ctgcaaactg aaaccgtgat taaccgtgac ggtcaggaag agaagcaggt ttccttcaac 2160 
agcatctaca tgatggccga ctccggtgcg cgtggttctg cggcacagat tcgtcagctt 2220 
gctggtatgc gtggtctgat ggcgaagccg gatggctcca tcatcgaaac gccaatcacc 2280 
gcgaacttcc gtgaaggtct gaacgtactc cagtacttca tctccaccca cggtgctcgt 2340 
aaaggtctgg cggataccgc actgaaaact gcgaactccg gttacctgac tcgtcgtctg 2400 
gttgacgtgg cgcaggacct ggtggttacc gaagacgatt gtggtaccca tgaaggtatc 2460 
atgatgactc cggttatcga gggtggtgac gttaaagagc cgctgcgcga tcgcgtactg 2520 
ggtcgtgtaa ctgctgaaga cgttctgaag ccgggtactg ctgatatcct cgttccgcgc 2580 
aacacgctgc tgcacgaaca gtggtgtgac ctgctggaag agaactctgt cgacgcggtt 2640 
aaagtacgtt ctgttgtatc ttgtgacacc gactttggtg tatgtgcgca ctgctacggt 2700 
cgtgacctgg cgcgtggcca catcatcaac aagggtgaag caatcggtgt tatcgcggca 27 60 
cagtccatcg gtgaaccggg tacacagctg accatgcgta cgttccacat cggtggtgcg 2820 
gcatctcgtg cggctgctga atccagcatc caagtgaaaa acaaaggtag catcaagctc 2880 
agcaacgtga agtcggttgt gaactccagc ggtaaactgg ttatcacttc ccgtaatact 2940 
gaactgaaac tgatcgacga attcggtcgt actaaagaaa gctacaaagt accttacggt 3000 
gcggtactgg cgaaaggcga tggcgaacag gttgctggcg gcgaaaccgt tgcaaactgg 3060 
gacccgcaca ccatgccggt tatcaccgaa gtaagcggtt ttgtacgctt tactgacatg 3120 
atcgacggcc agaccattac gcgtcagacc gacgaactga ccggtctgtc ttcgctggtg 3180 
gttctggatt ccgcagaacg taccgcaggt ggtaaagatc tgcgtccggc actgaaaatc 3240 
gttgatgctc agggtaacga cgttctgatc ccaggtaccg atatgccagc gcagtacttc 3300 
ctgccgggta aagcgattgt tcagctggaa gatggcgtac agatcagctc tggtgacacc 3360 
ctggcgcgta ttccgcagga atccggcggt accaaggaca tcaccggtgg tctgccgcgc 3420 
gttgcggacc tgttcgaagc acgtcgtccg aaagagccgg caatcctggc tgaaatcagc 3480 
ggtatcgttt ccttcggtaa agaaaccaaa ggtaaacgtc gtctggttat caccccggta 354 0 
gacggtagcg atccgtacga agagatgatt ccgaaatggc gtcagctcaa cgtgttcgaa 3 600 
ggtgaacgtg tagaacgtgg tgacgtaatt tccgacggtc cggaagcgcc gcacgacatt 3660 
ctgcgtctgc gtggtgttca tgctgttact cgttacatcg ttaacgaagt acaggacgta 3720 
taccgtctgc agggcgttaa gattaacgat aaacacatcg aagttatcgt tcgtcagatg 37 8 0 
ctgcgtaaag ctaccatcgt taacgcgggt agctccgact tcctggaagg cgaacaggtt 3840 
gaatactctc gcgtcaagat cgcaaaccgc gaactggaag cgaacggcaa agtgggtgca 3900 
acttactccc gcgatctgct gggtatcacc aaagcgtctc tggcaaccga gtccttcatc 3960 
tccgcggcat cgttccagga gaccactcgc gtgctgaccg aagcagccgt tgcgggcaaa 4 020 
cgcgacgaac tgcgcggcct gaaagagaac gttatcgtgg gtcgtctgat cccggcaggt 4080 
accggttacg cgtaccacca ggatcgtatg cgtcgccgtg ctgcgggtga agctccggct 4140 
gcaccgcagg tgactgcaga agacgcatct gccagcctgg cagaactgct gaacgcaggt 4200 
ctgggcggtt ctgataacga gtaa 4224 

<210> 732 
<211> 3734 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Concatenation of S. pyogenes genes 
<220> 

<221> misc_feature 

<222> 499-598, 1049-1148, 1587-1686, 2093-2192, 2652-2751, 
3201-3301 

<223> n = A,T,C or G 
<400> 732 

aaccttaatt ggaaagaaac ccaagaagtc ggttccgttg ttgaaaaaga attgggcatt 60 
ccttttgcca ttgacaatga tgccaatgtg gctgcccttg gtgaacgttg ggtaggtgct 120 
ggtgaaaata acccagatgt cgtcttcatg acacttggaa caggtgtcgg tggaggcatt 180 
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attgctgatg gtaacttgat tcatggtgtt 
attgttgagc cagaaaatgg ctttgcttgt 
gtagcttcag caacaggagt tgtcaaagtg 
gattcagcca tcaaagcagc tattgacaat 
atggcggctg aagcagggga ttcctttgct 
cttggccttg cttcagcann nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
cttacttcga accatgaatc ttttggaagt 
gattgatatt accgataaga agaatgatat 
tttccagcag tttaacctct ttcccaatat 
aatcaaaact aagggaatgg ctaaagcaga 
caaagttgga ttatcagaaa aagccaaggc 
gcagcggatt gcgattgcgc gtggactggc 
accaacttca gctctagacc cagaaatggt 
ggctaaatct gggatgacta tggttattnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnngc atgcaatacc gcaacagcgg 
atattcctgt tttaggagtt gtcttaccgg 
aaggccaggt tggggtcatc ggaaccccaa 
aaatccagct attagcacca tctattcaag 
cgattgtgga atcaaatgag atgtgttcga 
tagcaccatt agtcggtaaa atagataccc 
tacgaccaat tatccaaaat gttatggggc 
aatgcgtccg agatatctct gtcttannnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnacct atattaatcg tttacagaaa 
ttgcaaagtt tagcagtcgt tgctgaaacc 
aatcatgtga ttacaattca agaaggtcgt 
caggaataca ttcccaatag tatctctttt 
ggtccaaata tgagtggtaa gtcgacttat 
gcccagatgg gttcatttgt ggctgctgat 
tttacgcgta ttggggctgc tgatgatttg 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
catgaacatc aatcccaaaa caagccgtaa 
agcaggtcat ggaagtgcca tgctttatag 
tgtagaagat ttaaagaact tccgtcaatg 
gaaccacaca gacggtgtcg aagcaaccac 
cgttgggatg gccatggcag aagctcatct 
catcgttgat cactacacat ttgctttgaa 
ccaagaagca gcaagtatgg caggacattt 
ttcaaacgac annnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
ttctaaaggt agataatttt ttaactcatc 
gtaaagtgtt tgctcaaaaa tatgctgagg 
cttcaggtat tgcaccagcc gtatacgctg 
cgaaaaaaca taaaaacatt accatgacag 
tcactaaaca agtgacgagc acggtgtcta 
aggttttgat tattgatgac tttttagcta 
ttattggtca agcaggggca caagtcgtcg 
aagatggtcg tcgattgatt nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nttgcagctg atagccaacg taaagcccaa 
gagattgcac ctgtcactat tcctcagcgt 
gaatacccta aatttggaac gacagtggat 
aaagatgagg ggacagtaac tgctggtaat 
attttattga tgagtaaaga aaaagctgaa 
actagttatg caagtgcagg tgtagaccca 
acgaaaaagg ctcttgcaaa ggctcagctg 
aacgaagctt ttgc 



gcaggagcag gtggtgaaat cggccacatg 24 0 
acttgtggct cacacggctg tttggaaaca 300 
gcacgtttac tggcagaagc ctacgaaggg 360 
ggtgaaggtg ttaccagtaa agacattttc 420 
gattctgttg tggaaaaggt tggttactac 480 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 54 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 600 
accaactaag ggacagatta ggtttgaggg 660 
tttcagcatg cgtgaaaaaa tgggaatggt 720 
gactatttta gaaaatatca ctttatcgcc 780 
ggctgacaaa acagccttga gcttgttgga 840 
ttatcctgct agtctttctg gtgggcaaca 900 
tatggatcca gatgttttac tctttgatga 960 
gggtgaggtc ttggctgtca tgcaagattt 1020 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 
tggcttggga agaagtaaaa gcagctttag 1200 
gggcaagcgc agctattaaa tcaacgacaa 12 60 
tgacagtggc ttcagacatt tatcgcaaaa 1320 
taaggagtct tgcttgcccg aagtttgtac 1380 
gtatagctaa aaaaatagtt tatgacagtc 14 4 0 
ttgtactagg atgtactcac tatcccttgt 1500 
catctgttaa gctgattgac agtggagcag 15 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1680 
ctggctaaaa ttttggcaac ggtagatgtt 1740 
aatcattata tccggccgca gttcaatgat 1800 
cacgcggttg ttgaaaaggt tatgggagtg 18 60 
gaccaacaga ccagtattca gctgattaca 1920 
atgagacagc tggccttaac ggttatcatg 1980 
catgttgatt tacctttatt tgatgcgatt 204 0 
atttctgggc aatcaacctt tannnnnnnn 2100 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 
nnctatggcc tatgttcttt ggaatcactt 2220 
ttggtcaaac agagaccgtt ttatcctatc 2280 
cttgttacac ttagctggtt atgatttatc 2340 
gggttctaaa acaccaggtc acccagaagt 2400 
aggacctctt ggtcaaggga tcgcaaatgc 24 60 
agcagctaaa tttaacaaac caggctttga 2520 
tggtgacggt gaccttatgg aaggggtcag 2580 
aaaacttggg aaattggtct tgctatatga 2640 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27 00 
nnnnnnnnnn nnnnnnnnnn ngagagaata 27 60 
aagttgatta ccggttgatg aaagcaattg 2820 
ctggcattac aaaagtggtt acaatcgaag 2880 
cagaagcaat ggatgttcct atgatttttg 2940 
aaggcatttt gacagcagaa gtttattctt 3000 
tcgctggtaa attcctatct aaagaagaca 3060 
atggtcaggc agccaaaggc ttgattgaga 3120 
gcgttggtat tgtgattgag aaatctttcc 3180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 32 4 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3300 
cttgccatag aaaaaggtcg tttcaaagaa 33 60 
aaaggtgaac ctttactcgt tgatcaagat 3420 
aagttagcaa agttacgccc tgcttttatc 3480 
gcttcaggaa tcaatgatgg agcagcggca 3540 
gaattagggc tccctatttt agctaaaatc 3600 
agtattatgg gctgcggacc aatacctgct 3660 
acaattgatg acattgattt gattgaagca 3720 
3734 
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<210> 733 
<211> 288 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bacillus anthracis 



<400> 733 

atgagtaaaa aacaacaagg ttataacaag 
aatgctagtt atggtacaga gtttgcgact 
aacgcacaat cagaagctaa gaaagcgcaa 
gctagttatg gtacagaatt tgcaactgaa 
gcacaatcag ctgcaaaaca atcacaatct 

<210> 734 
<211> 1185 
<212> DNA 

<213> Artificial Sequence 



gcaacttctg gtgctagcat tcaaagcaca 60 

gaaacaaatg tacaagcagt aaaacaagca 120 

gcttc-tggtg ctagcattca aagcacaaat 18 0 

acagacgtgc atgctgtgaa aaaacaaaat 240 

tctagttcaa atcagtaa 288 



<220> 

<223> Escherichia coli 



<400> 734 

atgtctaaag aaaagtttga acgtacaaaa 
gttgaccatg gtaaaacaac gctgaccgct 
ggcggtgctg ctcgcgcatt cgaccagatc 
atcaccatca acacttctca cgttgaatac 
gactgcccgg ggcacgccga ctatgttaaa 

ggcgcgatcc tggtagttgc tgcgactgac 
ctgctgggtc gtcaggtagg cgttccgtac 
gttgatgacg aagagctgct ggaactggtt 
tacgacttcc cgggcgacga cactccgatc 
ggcgacgcag agtgggaagc gaaaatcctg 
ccggaaccag agcgtgcgat tgacaagccg 
atctccggtc gtggtaccgt tgttaccggt 
gaagaagttg aaatcgttgg tatcaaagag 
atgttccgca aactgctgga cgaaggccgt 
ggtatcaaac gtgaagaaat cgaacgtggt 
ccgcacacca agttcgaatc tgaagtgtac 
actccgttct tcaaaggcta ccgtccgcag 
accatcgaac tgccggaagg cgtagagatg 
gttaccctga tccacccgat cgcgatggac 
ggccgtaccg ttggcgcggg cgttgtagca 

<210> 735 
<211> 2856 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 



ccgcacgtta acgtcggtac tatcggccac 60 

gcaatcacta ccgtactggc taaaacctac 120 

gataacgcgc cggaagaaaa agctcgtggt 180 

gacQccccga cccgtcacta cgcacacgta 240 

aacatgatca ccggtgctgc gcagatggac 300 

ggcccgatgc cgcagactcg tgagcacatc 360 

atcatcgtgt tcctgaacaa atgcgacatg 420 

gaaatggaag ttcgtgaact tctgtctcag 480 

gttcgtggtt ctgctctgaa agcgctggaa 540 

gaactggctg gcttcctgga ttcttacatt 60 0 

ttcctgctgc cgatcgaaga cgtattctcc 660 

cgtgtagaac gcggtatcat caaagttggt 720 

actcagaagt ctacctgtac tggcgttgaa 780 

gctggtgaga acgtaggtgt tctgctgcgt 840 

caggtactgg ctaagccggg caccatcaag 900 

attctgtcca aagatgaagg cggccgtcat 960 

ttctacttcc gtactactga cgtgactggt 1020 

gtaatgccgg gcgacaacat caaaatggtt 1080 

gacgg-tctgc gtttcgcaat ccgtgaaggc 1140 

aaagt-tctga gctaa 1185 



<400> 735 

atggaaaaga catataaccc acaagatatc 
cagggctact ttaagcctaa tggcgatgaa 
ccgccgaacg tcaccggcag tttgcatatg 
accatgatcc gctatcagcg catgcagggc 
cacgccggga tcgctaccca gatggtcgtt 
acccgtcacg actacggccg cgaagctttc 



gaacagccgc tttacgagca ctgggaaaag 60 

agccaggaaa gtttctgcat catgatcccg 120 

ggtcacgcct tccagcaaac catcatggat 180 

aaaaacaccc tgtggcaggt cggtactgac 240 

gagcgcaaga ttgccgcaga agaaggtaaa 300 

atcgacaaaa tctgggaatg gaaagcggaa 360 
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tctggcggca ccattacccg tcagatgcgc 
gaacgcttca ccatggacga aggcctgtcc 
tataaagaag acctgattta ccgtggcaaa 
accg-ctatct ctgacctgga agtcgaaaac 
cgctatccgc tggctgacgg tgcgaaaacc 
actacccgtc cagaaaccct gctgggcgat 
cgttacaaag atctgattgg caaatatgtc 
atcg-ttggcg acgaacacgc cgacatggaa 
gcgcacgact ttaacgacta tgaagtgggt 
ctgacctttg acggcgatat ccgtgaaagc 
tctg-acgttt attccagcga aatccctgca 
cgtaaagcag tcgttgccgc agttgacgcg 
gacctgaccg ttccttacgg cgaccgtggc 
cagtggtacg tgcgtgccga tgtcctggcg 
gacattcagt tcgtaccgaa gcagtacgaa 
cagg-actggt gtatctctcg tcagttgtgg 
gaag-cgggta acgtttatgt tggccgcaac 



cgtctcggca actccgtcga ctgggagcgt 420 

aatgcggtga aagaagtttt cgttcgtctg 480 

cgcctggtaa actgggatcc gaaactgcgc 540 

cgcgaatcga aaggttcgat gtggcacatc 600 

gcagacggta aagattatct ggtggtcgcg 660 

actggcgtag ccgttaaccc ggaagatccg 720 

attctgccgc tggttaaccg tcgtattccg 780 

aaaggcaccg gctgcgtgaa aatcactccg 840 

aaacgtcacg ccctgccgat gatcaacatc 900 

gcccaggtgt tcgataccaa aggtaacgaa 960 

gagttccaga aactggagcg ttttgctgca 1020 

cttggcctgc tggaagaaat taaaccgcac 1080 

ggcgtagtta tcgaaccaat gctgaccgac 1140 

aaaccggcgg ttgaagcggt tgagaacggc 1200 

aacatgtact tctcctggat gcgcgatatt 1260 

tggggtcacc gtatcccggc atggtatgac 1320 

gaagacgaag tgcgtaaaga aaataacctc 1380 



ggtgctgatg ttgtcctgcg tcaggacgaa 
ctgtggacct tctctaccct tggctggccg 
ccaaccagcg tgatggtatc tggtttcgac 
atga-fcgacca tgcacttcat caaagatgaa 
gtttacatga ccggcctgat tcgtgatgac 
aacg-ttatcg acccactgga tatggttgac 
cgtaccggca atatgatgca gccgcagctg 
cagt-tcccga acggtattga gccgcacggt 
ctggcgtcta ccggtcgtga catcaactgg 
ttctgtaaca agctgtggaa cgccagccgc 
tgcg-gcttca acggcggcga aatgacgctg 
ttcaaccaga ccatcaaagc gtaccgcgaa 
gcaggcattc tgtatgagtt cacctggaac 
aagccggtaa tgaacggtgg caccgaagca 
actg-tactgg aaggtctgct gcgcctcgcg 
atctggcagc gtgtgaaagt actttgcggt 
ttcc cgcagt acgatgcatc tcaggttgat 
aaacaggcga tcgttgcggt acgtaacatc 
ccgc-tggagc tgctgctgcg tggttgcagc 
cgtggcttcc tgcaaaccct ggcgcgtctg 
aaaggtccgg tttccgttac gaagatcatc 
ggcc-tcatca acaaagaaga tgagctggcg 
ggtgaaatca gccgtatcga gaacaaactg 
gaagcggtca tcgcgaaaga gcgtgagaag 
ctga-ttgaac agcaggctgt tatcgccgcg 

<210> 736 
<211> 1770 
<212> DNA 

<213> Artificial Sequence 



gacgttctcg atacctggtt ctcttctgcg 14 40 
gaaaataccg acgccctgcg tcagttccac 1500 
atcattttct tctggattgc ccgcatgatc 1560 
aatggcaaac cgcaggtgcc gttccacacc 1620 
gaaggccaga agatgtccaa atccaagggt 1680 
ggtatttcgc "tgccagaact gctggaaaaa 17 40 
gcggacaaaa tccgtaagcg caccgagaag 1800 
actgacgcgc tgcgcttcac cctggcggcg 18 60 
gatatgaagc gtctggaagg ttaccgtaac 1920 
tttgtgctga "tgaacacaga aggtcaggat 1980 
tcgctggcgg accgctggat tctggcggag 2040 
gcgctggaca gcttccgctt cgatatcgcc 2100 
cagttctgtg actggtatct cgagctgacc 2160 
gaactgcgcg gtactcgcca tacgctggtg 2220 
catccgatca "ttccgttcat caccgaaacc 2280 
atcactgccg acaccatcat gctgcagccg 2340 
gaagccgcac "tggccgacac cgaatggctg 24 00 
cgtgcagaaa "tgaacatcgc gccgggcaaa 24 60 
gcggatgcag aacgtcgcgt aaatgaaaac 2520 
gaaagtatca ccgtgctgcc tgccgatgac 2580 
gacggtgcag agctgctgat cccgatggct 2640 
cgtctggcga aagaagtggc gaagattgaa 27 00 
gcgaacgaag gctttgtcgc ccgcgcaccg 27 60 
ctggaaggct atgcggaagc gaaagcgaaa 2820 
ctgtaa 2856 



<220> 

<223:> Escherichia coli 



<400> 736 

atgcgtacag aatattgtgg acagctccgt 
tgtggttggg tcaaccgtcg tcgtgatctt 
cgcgsaggta tcgtgcaggt atttttcgat 
tctgaactgc gtaatgagtt ctgcattcag 
aaaaatatta accgcgatat ggcgaccggc 
atca-tcaacc gcgcagatgt tctgccgctt 
cgtc-tgaaat accgctacct cgacctgcgt 
cgcgctaaaa tcaccagcct ggtgcgccgt 



ttgtcccacg "tggggcagca ggtgactctg 60 

ggtagcctga tcttcatcga tatgcgcgac 120 

ccggatcgtg cggacgcgtt aaagctggcc 180 

gtcacgggca ccgtacgtgc gcgtgacgaa 240 

gaaatcgaag -tgctggcgtc ctcgctgact 300 

gactctaacc acgtcaacac cgaagaagcg 360 

cgtccggaaa tggctcagcg cctgaaaacc 420 

tttatggatg accacggctt cctcgacatc 4 80 
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gaaactccga tgctgaccaa agccacgccg gaaggcgcgc gtgactaact ggtgccttct 540 

cgtgtgcaca aaggtaaatt ctacgcactg ccgcaatccc cgcagttgtt caaacagctg 600 
ctgatgatgt ccggttttga ccgttactat cagatcgtta aatgcttccg tgacgaagac 660 
ctgcgtgctg accgtcagcc tgaatttact cagatcgatg tggaaacttc tttcatgacc 720 

gcgccgcaag tgcgtgaagt gatggaagcg ctggtgcgtc atctgtggct ggaagtgaag 780 
ggtgtggatc tgggcgattt cccggtaatg acctttgcgg aagcagaacg ccgttatggt 840 

tctgataaac cggatctgcg taacccgatg gaactgactg acgttgctga tctgctgaaa 900 
tctgttgagt ttgctgtatt tgcaggtccg gcgaacgatc cgaaaggtcg cgtagcggct 960 

ctgcgcgttc cgggcggcgc atcgctgacc cgtaagcaga tcgacgaata cggtaacttc 1020 
gttaaaatc-fc acggcgcgaa aggtctggct tacatcaaag ttaacgaacg cgcgaaaggt 1080 

ctggaaggta tcaacagccc ggtagcgaag ttccttaatg cagaaatcat cgaagacatc 1140 

ctggatcgta ctgccgcgca agatggcgat atgattttct tcggtgccga caacaagaaa 1200 

attgttgcccj acgcgatggg tgcactgcgc ctgaaagtgg gtaaagacct tggtctgacc 12 60 

gacgaaagca aatgggcacc gctgtgggtt atcgacttcc cgatgtttga agacgacggt 1320 

gaaggcggcc tgacggcaat gcaccatccg ttcacctcac cgaaagatat gacggctgca 1380 

gaactgaaag ctgcaccgga aaatgcggtg gcgaacgctt acgatatggt catcaatggt 14 4 0 

tacgaagtgg gcggtggttc agtacgtatc cataatggtg atatgcagca gacggtgttt 1500 

ggtattctgg gtatcaacga agaggaacag cgcgagaaat tcggcttcct gctcgacgct 1560 

ctgaaatacg gtactccgcc gcacgcaggt ctggcattcg gtcttgaccg tctgaccatg 1620 

ctgctgaccg gcaccgacaa tatccgtgac gttatcgcct tcccgaaaac cacggcggca 1680 

gcgtgtctga tgactgaagc accgagcttt gctaacccga ctgcactggc tgagctgagc 1740 
attcaggttg tgaagaaggc tgagaataac 1770 

<210> 737 
<211> 3699 
<212> DNA 

<213> Art±ficial Sequence 
<220> 

<223> Yersinia pestis 
<400> 737 

atgatggttt tccagccaat cagtgagttt ctcttgataa ggaatgcggg aatgtctatg 60 

tattttaata aaataatttc atttaatatt atttcacgaa tagttatttg tatctttttg 120 

atatgtggaa tgttcatggc tggggcttca gaaaaatatg atgctaacgc accgcaacag 180 

gtccagcctt attctgtctc ttcatctgca tttgaaaatc tccatcctaa taatgaaatg 240 

gagagttcaa tcaatccctt ttccgcatcg gatacagaaa gaaatgctgc aataatagat 300 

cgcgccaata aggagcagga gactgaagcg gtgaataaga tgataagcac cggggccagg 360 

ttagctgcat caggcagggc atctgatgtt gctcactcaa tggtgggcga tgcggttaat 420 

caagaaatca aacagtggtt aaatcgattc ggtacggctc aagttaatct gaattttgac 480 

aaaaattttt cgctaaaaga aagctctctt gattggctgg ctccttgg-ta tgactctgct 540 

tcattcctct tttttagtca gttaggtatt cgcaataaag acagccgcaa cacacttaac 600 

cttggcgtcg ggatacgtac attggagaac ggttggctgt acggacttaa tactttttat 660 

gataatgatt tgaccggcca caaccaccgt atcggtcttg gtgccgaggc ctggaccgat 720 

tatttacagt tggctgccaa tgggtatttt cgcctcaatg gatggcactc gtcgcgtgat 780 

ttctccgact ataaagagcg cccagccact gggggggatt tgcgcgcgaa tgcttattta 840 

cctgcactcc cacaactggg ggggaagttg atgtatgagc aatacaccgg tgagcgtgtt 900 

gctttattgt gcccagaaaa cccccagcta ggctgggggt tcagtaaagc tttcagcttt 960 

gggtcagtta taaaaacccc ttttgatttg ttaaaacagt ttgcggtctg gcaactgcaa 1020 

atgttcaaca agaaatcaaa agggggtccc aatgagggat gaaaagagct tagcgcacac 108 0 

ccgatggaac tgtaaatatc atatagtttt tgcgccgaag taccgaaggc aggtgttcta 114 0 

cagggaaaaa cgcagagcga ttggcagtat tttaagaaaa ctgtgcgaat ggaaaaacgt 1200 

gaatatcctg gaagcagaat actgtgtgga tcacatccat atgcttctgg agatcccgcc 1260 

caagatgagt gtctcgggat ttatggggta cctgaaggga aagagcag-hc tgatgcttta 1320 

tgagcagttt ggcgatttga agttcaaata ccgtaacagg gagttttggt gtcgagggta 1380 

ttacgttgat acggtaggga aaaacacggc caggatacaa gaatacataa agcaccaatt 1440 

ggaagaggat aaaatgggtg agcaactctc gatcccgtat cccggtagcc cgtttacggg 1500 

ccgtaagtaa tccatagatg caaatgtcag atcgcgatgc gcctgttagg gcgcggctgg 1560 

taacagagcc ttataggcgc atatgaaaaa cctccggcta tgccggagga tatttatttt 1620 

ggtaaagata atctgcaacg caacccttat gccgtgactg ccgggatcaa ttacaccccc 1680 

gtgcctctac tcactgtcgg ggtagatcag cgtatgggga aaagcagtaa gcatgaaaca 1740 
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cagtggaacc 
tcagcggtgg 
aataatatcg 
actatctccg 
gctgtaaggg 
ccactgagta 
agtcgggtaa 
caccaaggaa 
acattaacgg 
gttgagttca 
accaccaata 
gcgcgcattg 
gggcaacggc 
actctggctg 
ttggagttga 
accttaggca 
accagtacca 
gtgccgagtg 
ttcaccgtct 
gtccctgtcg 
aacggtgtgc 
gtggttggga 
agtacattgc 
aacgggcaaa 
ttccagttac 
ttcacaccca 
agcgaagtgg 
ttctacccaa 
agcagacaat 
ggaacgcgtg 
tctgattggc 
aatatgaaca 
ctgtcaatat 



tccaaatgaa 
cgggaacacg 
tgttggagta 
gcctgccggg 
aaattgtctg 
ccacacaatt 
cggacgacct 
acagatctaa 
cggccgtcat 
ccgttgctga 
atggtgcgct 
cattaaccaa 
aaagtgttga 
cggtaccgac 
aggataccta 
atatgggcgt 
cgttgggggt 
tgacggttaa 
ccacaccgga 
ataagaatgg 
cggtgagtat 
ataccgccgg 
agaaaaaaat 
atttcgctac 
agatggataa 
atgtatcggt 
ctgtgacggc 
atcggtggat 
gccaaggttc 
cgcctgacgg 
aatctggcga 
ctggcctgct 
aaccagataa 



ctatcgcctg 
tctactggct 
tcagaaacag 
tcaggtttat 
gagtgatgcc 
caacttggtt 
gacagccaac 
ctcattcaca 
tggtgatggc 
ttttgagggg 
accgaataaa 
tacgacagat 
tacccacttt 
atctatcatc 
tggggacccg 
tatcacggat 
agcaecagta 
tttcacggca 
tatcttggct 
ccattttatc 
tagccccatt 

atccctattc 
ggataaaggg 
cgatgttgct 
taacgatcag 
gaaaagtaaa 
atacgatggc 
agatatgtct 
gacattgtgg 
atattgggtc 
gcaaccaggg 
cagatagcaa 



ggcgagagtt 
gagagccgct 
caggtggtta 
caggtgaacg 
gaactgattg 
ttaccgcctt 
ttttattcgc 
ttgagcgtca 
gcaccggcta 
aaacccttag 
atcacggaaa 
ggcgtgacgg 
gttaagggta 
gctgatggtc 
caggctggcg 
cacaatgacg 
acggtgaaag 
gatcctattc 
gatggcacga 
agtgggatgc 
accgagcagc 

ccggtaccta 
ttcccgaaaa 
aataatactc 
ggtcaggtga 
aaattcccaa 
ggcacttcgc 
gcggntcttg 
ggcgagtggg 
aaaaggacca 
cctgcatact 
taagaacag 



ttcagtcgca 
ataaccttgt 
aactgacatt 
cacaagtaca 
ccgctggcgg 
ataaacgcac 
ttagtgcgct 
ccgttcagca 
gtgggaaaac 
ccgggcagga 
agacagatgc 
tagtcacagc 
ctatcgcggc 
taatggcttc 
cgaatgtggc 
gcacttatag 
tggatggggc 
cagatgctgg 
tgagttccac 
agggcttgag 
cagatagcta 
tggttgatac 
cgctgaccgg 
cgatctttaa 
agtatgagtg 
cgattaccta 
gttattcggt 
tggtatcgag 
aatcctcacg 
ggagcttgac 
gcacggattt 
tggcgttccc 



acttagccct 
cgatcgtaac 
atcgccagca 
aggggcatct 

agcacaagtg 
cgcggttgat 
gcctcagttg 
tgcaatcacc 
agtggtgata 
aaatggcgtc 
agaagtggag 
ggataaatcc 
aaccatcacg 
ttttgacaca 
cgcaccattg 
tgcgttcagt 
ccgctccagt 
attatccttt 
ttttactcaa 
taccgcgacg 
cctgatactg 
tattctggtt 
aaacgccaca 
gtcgtcgtca 
ccaaacctat 
gagttatcgg 
tatcgaggcc 
tgcaaccaac 
cgcgtatagt 
tgaaaccatg 
gctctgtgcg 



1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2B80 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3699 



<210> 738 
<211> 3891 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Clostridium botulinum 



<400> 738 

atgcaatttg ttaataaaca 
tatataaaaa ttccaaatgt 
aaaatatggg ttattccaga 
ccaccaccag aagcaaaaca 
gataatgaaa aagataatta 
actgatcttg gaagaatgtt 
agtacaatag atacagaatt 
gatggtagtt atagatcaga 
atacagtttg aatgtaaaag 
ggctctactc aatacattag 
gaagttgata caaatcctct 
ttagcacatg aacttataca 

agggttttta aagtaaatac 
gaggaactta gaacatttgg 
gaatttcgtc tatattatta 
aaatcaatag taggtactac 
tatctcctat ctgaagatac 



atttaattat aaagatcctg 
aggacaaatg caaccagtaa 

aagagataca tttacaaatc 

agttccagtt tcatattatg 

tttaaaggga gttacaaaat 

gttaacatca atagtaaggg 

aaaagttatt gatactaatt 

agaacttaat ctagtaataa 

ctttggacat gaagttttga 

atttagccca gattttacat 

tttaggtgca ggcaaatttg 

tgctggacat agattatatg 

taatgcctat tatgaaatga 

gggacatgat gcaaagttta 

-taataagttt aaagatatag 

tgcttcatta cagtatatga 

atctggaaaa ttttcggtag 



taaatggtgt tgatattgct 60 

aagcttttaa aattcataat 120 

ctgaagaagg agatttaaat 180 

attcaacata tttaagtaca 240 

tatttgagag aatttattca 300 

gaataccatt ttggggtgga 360 

gtattaatgt gatacaacca 420 

taggaccctc agctgatatt 4 80 

atcttacgcg aaatggttat 540 

ttggttttga ggagtcactt 600 

ctacagatcc agcagtaaca 660 

gaatagcaat taat.ccaaat 720 

gtgggttaga agtaagcttt 780 

tagatagttt acaggaaaac 840 

caagtacact taataaagct 900 

aaaatgtttt taaagagaaa 960 

ataaattaaa atttgataag 1020 
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ttatacaaaa tgttaacaga , gatttacaca 
cttaacagaa aaacatattt gaattttgat 
aaggtaaatt acacaatata tgatggattt 
tttaatggtc aaaatacaga aattaataat 
ggattgtttg aattttataa gttgctatgt 
tcattagata aaggatacaa taaggcatta 
gacttgtttt ttagtccttc agaagataat 
attacatctg atactaatat agaagcagca 
caatattatt taacctttaa ttttgataat 
tcaagtgaca ttataggcca attagaactt 
aaaaagtatg agttagataa atatactatg 
catggtaaat ctaggattgc tttaacaaat 
cgtgtttata catttttttc ttcagactat 
gctatgtttt taggctgggt agaacaatta 
gtaagtacta cggataaaat tgcggatata 
ttaaatatag gtaatatgtt atataaagat 
gctgttattc tgttagaatt tataccagag 
cttgtatcat atattgcgaa taaggttcta 
aaaagaaatg aaaaatggga tgaggtctat 
gttaatacac agattgatct aataagaaaa 
gaagcaacaa aggctataat aaactatcag 
aatattaatt ttaatattga tgatttaagt 
atgattaata taaataaatt tttgaatcaa 
atcccttatg gtgttaaacg gttagaagat 
aagtatatat atgataatag aggaacttta 
gttaataata cacttagtac agatatacct 
agattattat ctacatttac tgaatatatt 
ttaagatatg aaagtaatca tttaatagac 
ggtagtaaag taaattttga tccaatagat 
agtagtaaaa ttgaggtaat tttaaaaaat 
tttagtacta gcttttggat aagaattcct 
gaatatacaa taataaattg tatggaaaat 
ggtgaaataa tctggacttt acaggatact 
tacagtcaaa tgattaatat atcagattat 
aataatagat taaataactc taaaatttat 
atttcaaatt taggtaatat tcatgctagt 
agagatacac atagatatat ttggataaaa 
gaaaaagaaa tcaaagattt atatgataat 
tggggtgatt atttacaata tgataaacca 
aaatatgtcg atgtaaataa tgtaggtatt 
ggtagcgtaa tgactacaaa catttattta 
attataaaaa aatatgcttc tggaaataaa 
tatattaatg tagtagttaa aaataaagaa 
ggcgtagaaa aaatactaag tgcattagaa 
gtagtaatga agtcaaaaaa tgatcaagga 
gataataatg ggaatgatat aggctttata 
ctagtagcaa gtaattggta taatagacaa 
tcatgggaat ttattcctgt agatgatgga 

<210> 739 
<211> 9047 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Staphylococcus aureus 
<400> 739 

aagcttcttg tgaaaaaata tcttcaatct 
ttgagacaag tgtttttcca atttcacccg 
tattaataat aagcccagca ccaatacctt 



gaggataatt ttgttaagtt ttttaaagta 1080 
aaagccgtat ttaagataaa tatagtacct 1140 
aatttaagaa atacaaattt agcagcaaac 1200 
atgaatttta ctaaactaaa aaattttact 12 60 
gtaagaggga taataacttc taaaactaaa 1320 
aatgatttat gtatcaaagt taataattgg 1380 
tttactaatg atctaaataa aggagaagaa 1440 
gaagaaaata ttagtttaga tttaatacaa 1500 
gaacctgaaa atatttcaat agaaaatctt 1560 
atgcctaata tagaaagatt tcctaatgga 1620 
ttccattatc ttcgtgctca agaatttgaa 1680 
tctgttaacg aagcattatt aaatcctagt 1740 
gtaaagaaag ttaataaagc tacggaggca 1800 
gtatatgatt ttaccgatga aactagcgaa 1860 
actataatta ttccatatat aggacctgct 1920 
gattttgtag gtgctttaat attttcagga 198 0 
attgcaatac ctgtattagg tacttttgca 2 040 
accgttcaaa caatagataa tgctttaagt 2100 
aaatatatag taacaaattg gttagcaaag 2160 
aaaatgaaag aagctttaga aaatcaagca 2 220 
tataatcaat atactgagga agagaaaaat 2 280 
tcgaaactta atgagtctat aaataaagct 2 340 
tgctctgttt catatttaat gaattctatg 2 400 
tttgatgcta gtcttaaaga tgcattatta 2 4 60 
attggtcaag tagatagatt aaaagataaa 2 520 
tttcagcttt ccaaatacgt agataatcaa 2 58 0 
aagaatatta ttaatacttc tatattgaat 2 64 0 
ttatctaggt atgcatcaaa aataaatatt 2 700 
aaaaatcaaa ttcaattatt taatttagaa 2 760 
gctattgtat ataatagtat gtatgaaaat 2 820 
aagtatttta acagtataag tctaaataat 2 880 
aattcaggat ggaaagtatc acttaattat 2 94 0 
caggaaataa aacaaagagt agtttttaaa 3O00 
ataaacagat ggatttttgt aactatcact 3O60 
ataaatggaa gattaataga tcaaaaacca 312 0 
aataatataa tgtttaaatt agatggttgt 3180 
tattttaatc tttttgataa ggaattaaat 3 240 
caatcaaatt caggtatttt aaaagacttt 3 300 
tactatatgt taaatttata tgatccaaat 3 360 
agaggttata tgtatcttaa agggcctaga 3 420 
aattcaagtt tgtatagggg gacaaaattt 3 480 
gataatattg ttagaaataa tgatcgtgta 3 540 
tataggttag ctactaatgc atcacaggca 3 600 
atacctgatg taggaaatct aagtcaagta 3 660 
ataacaaata aatgcaaaat gaatttacaa 3720 
ggatttcatc agtttaataa tatagctaaa 3 7 80 
atagaaagat ctagtaggac tttgggttgc 3 840 
tggggagaaa ggccactgta a 3 8 91 



tatgaaagat ctccacatta tcgctaactt 60 
cttccccatt tgcaccacga tacaattgat 12 0 
tatgtatact taaagcaata agattattgt 180 



149 



WO 2005/098047 



PCT/US2005/005356 



aggataaatt atgattaaaa ttacgttcat 
ctacgactgg aacattagta atttctttta 
catggaatgg taaatatgtc acatgctgct 

acacacctaa tagtccgtta taagtatcaa 
ttatgcttaa tatactacta accttttcat 
cattaccatc aaaataattg tacatcactt 
taaaataacc ataaagatga ttaaccttca 
tgctatcacc ctcaccaacc tcattaacaa 
tagaaatcgt tgccttatta actcttttaa 
taaatttccc ttagcactct tttttcatta 
accccataag ttattttgaa tttagtttat 
ttaacataaa ttaaattaaa ccttagaaca 
gttcttttat tttttttatt atttcaataa 
tgcaaagtcg ttgtatgttt ttcaaatttt 
gggaattttt aacatttttt atttgtttga 
aggtatgcat atggatttca ctggtgttat 
cattcaggct ttcggttaat tttttcaact 
ttgtttcctc tgttttctat gtattgttgt 
tatttctcaa ttcttctatt tcatcttgtg 
caagtgaatt gaaaccgcct ttgtatactt 
ttatatcact ttcttctaca agagagtaat 
ctataaatcc ctttttatac aatctcgtta 
ccttttgcat ttgtatttct tctattatat 
tgatattcat aacttcccat tctgcagatg 
taattccatt tcttttaaaa tacgctcaga 
gtcttcgcct tttaaatgtg tagcaaaata 
tacgaaccat ccatttgctt ctttgtgatt 
tttataagta tctttttgtt tcaaagtcat 
atcaaaatgc atgttatgtt gtttcatatt 
aatttttaat gaagattcat tccaataatt 
ttcaattaga tctaaataag atttaacctc 
gtaataccaa tttactgaat atttcattga 
tttaaatgga tattgatgtt tatcccattg 
ttggtcgaat gccattaacg ctaaataaat 
tttactttct ggttcattat aaatagaata 
acttccatca aatcctttga aatacggagc 
tgtttcatat ttgtcttgtt gaacatgtgc 
tatactacat acaatatacg caacaatacg 
ataaagtgca atatacttaa cacgttcttt 
tgctgccaca ttatttatgt gctgagattt 
ataacgtata tgttcatggc gattcaaaat 
acagtcattg tccatcattg ttttacttat 
tttaaaaaca acataaagct ggttgaatat 
atgtagaata atatattcaa tttctttgtc 
tacaatttgg gatttcacta aaccaaatac 
aattgttata tgctttttgt agaactgcat 
aagatatgac gattccaatg acgaactttt 
catataaaat aagcacaaca aactaccaaa 
atttgaggtc tcaaactgat taacattaat 
accatctaac atatgactaa ccgaagaagt 
ttttgaaaat gtaaaattcg atattttgta 
gagcactaat aaccaaatct tatgtgacat 
tctcactaaa aaaattacac atatcgtgag 
agatgataac accttctaca cctccatatc 
tactacattt gtaatatact acaaatgtag 
ataaaaattg ttccacttat tttaatagtt 
gcttcaaaag ataaagaaat taataatact 
caagtttata aagatagcag ttatatttct 
gaacgtccga taaaaatata taatagttta 
aaaataaaaa aagtatctaa aaataaaaaa 



ataacgctga aagattcgct tcattttcaa 240 
ttttcttagc aattgaaatt ccttcagttt 300 
cattatccac aactccatgt atagaaacag 360 

gtttctcctg aatatcaata tgttttttta 420 
caggtaaatc ataagattca tgcttaatga 480 
caacagaac-t ataagttaaa tccaaagaaa 540 
gaagaatagg ttttctacca ccactcttcg 600 
gagatttata ctttaactta ttcaaaatac 660 
gatatttgga cgcgaaatat tatgatggta 72 0 
tcatttatat tttattttcc ataattgcct 780 
agtttatca-t gtatttttat accaactatt 84 0 
acttaaaacc tcacttaggt tatcatcttc 900 
attacgtat-fc tccaatatga cgatttttta 960 
tgacattta-t gcaatctaaa tagccaatta 1020 
tattatact-t aatgtatctt aaatagaaag 1080 
tacaagcat-t attgatttaa tcaagacttg 1140 
aaaaaacaga ggaaatattc aacgacttga 1200 
aaacacaaca attttatttt ttattcaata 12 60 
atagatcttc tttttctaca aagtttaaga 1320 
tattgataaa gtttttagat gttttatatt 1380 
attgaaaaa-fc tttattgtct tttttacgat 1440 
taagtgtacg aatggttttt ggactccagt 1500 
tattcgcac-fc tgcatatttt ttcatccaaa 1560 
atatttcata cgttttatta tccattatat 1620 
aatttgttg-fc gctttttcgc cattcgcatt 1680 
atacgtatta tctttcgttt caacataacc 1740 
cacgattcct gttccagttt tacctacata 1800 
actattttca actttttcaa tagccttatt 1860 
tttcaacaaa ttaacctgtt ctattgcaga 1920 
ttcattccct gatatttctt cattaccata 1980 
atcttgtctt aaatgtttgt ttaaattttc 204 0 
agaatttaaa. ttttgatctt ggttccattc 2100 
ttgttcagta tgatttaatg agagtaaatt 2160 
tttgtaagta gaattaggtg aatatcgttg 2220 
agcttgctcc cgttcattat aaagcacaaa 2280 
tagttgattt aattttttat atgatacatt 2340 
agatagtaac ggtgcttgta ttaaaagcga 2400 
cttgtttcga. ttaggtttag gcattgaatc 2460 
aatatttgaa. ttaaaaccta gtaaatattg 2520 
taaaatagag cattttaata tcgattcacc 2580 
ttttaaaacg tttctatcac atactttttc 2640 
atatagtgca ggattaaacc agaatatcat 2700 
taagtcatga cttttcacat gtgatagttc 2760 
attcatggtt tcgactacga cagttggtag 2 82 0 
catcggatta. tcaatgtttg aactataact 2880 
cttactttga catactttaa gtcgttcatt 2940 
aataacatca atttgtcgga atgccttaat 3000 
tacccatatc aaaagaatca tatacgttat 3060 
tgctaagtct ttcgtaacag atgattgttg 3120 
cgtgtcagat acatttcgat tcatcatatc 3180 
aaatggtatt aatggaatta acgtggagac 324 0 
aatattttga. gtatatttta tatagagcat 3300 
caatgaactg- attatactta acattaaaaa 3360 
acaaaaatta taacattatt ttgacataaa 3420 
tcttatataa ggaggatatt gatgaaaaag 3480 
gtagttgtcg ggtttggtat atatttttat 3540 
attgatgcaa. ttgaagataa aaatttcaaa 3600 
aaaagcgata. atggtgaagt agaaatgact 3660 
ggcgttaaag atataaacat tcaggatcgt 3720 
cgagtagatg ctcaatataa aattaaaaca 37 80 
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aactacggta acattgatcg caacgttcaa 
aagttagatt gggatcatag cgtcattatt 
attgaaaatt taaaatcaga acgtggtaaa 
aatacaggaa cacatatgag attaggcatc 
aaagcaatcg ctaaagaact aagtatttct 
attgggtaca agatgatacc ttcgttccac 
ttaagtgatt tcgcaaaaaa atttcatctt 
cctctagaaa aagcgacttc acatctatta 
ttaaaacaaa aagaatataa aggctataaa 
gaaaaacttt acgataaaaa gctccaacat 
gataatagca atacaatcgc acatacatta 
attcaactaa ctattgatgc taaagttcaa 
tatggctcag gtactgctat ccaccctcaa 
ccttcatatg acgtctatcc atttatgtat 
accgaagata aaaaagaacc tctgctcaac 
actcaaaaaa tattaacagc aatgattggg 
agttataaaa tcgatggtaa aggttggcaa 
acaagatatg aagtggtaaa tggtaatatc 
aacattttct ttgctagagt agcactcgaa 
aaaaaactag gtgttggtga agatatacca 
tcaaacaaaa atttagataa tgaaatatta 
atactgatta acccagtaca gatcctttca 
attaacgcac ctcacttatt aaaagacacg 
tccaaagaaa atatcaatct attaaatgat 
aaagaagata tttatagatc ttatgcaaac 
aaaatgaaac aaggagaaag tggcagacaa 
aatccaaaca tgatgatggc tattaatgtt 
tacaatgcca aaatctcagg taaagtgtat 
tacgatatag atgaataaca aaacagtgaa 
ttattatgaa ttattaataa gtgctgttac 
attgtatgtt gaaagtgaca ctgtaacgag 
tgtaatttca gcgataacgt acaatgtatt 
gttattcatt tgtgttcctg ctacaacttc 
tttaaatgat attgaaagtg tatgcatgcc 
ttctgctttt tctttatcta tatgcatata 
aatttcttct tctgtaatat gaaggctttt 
atcgtatttc atatatgtct ctctttctta 
taaaggtaag tctaccgtca ctgaaacgta 
atgcaccatt ttatctgcat tgtctgtaaa 
gtttgcacgt gctggtttgt ttacagtcca 
catttttact tttgctttag taagtttggc 
gtaatctaaa agtgttctcc agtcttcacg 
gttatattat ggcatgattt cttctgcaag 
agctcttctt gattctgatt taagtttgtt 
ttagaaagtg ctagtccatt cggtccagta 
ttatattcat ttgctatttt tactacatca 
ttttcaccaa accaagatcc tgcagaagca 
atttccccgg acatatttgt agtccgttct 
tcatcttttg taattgcaac atctaactcc 
gctttaaatg atgcaattgt attttccgga 
acagttagca tattacctct ccttgcattt 
acattaatcg cacttttatt tccattaaaa 
tagattcgta ttgattatgg agttaatcta 
tgactcccaa gctccatttt gtaatcaagt 
ttacttagga ttgcttttaa cttacccctt 
gatcacctct tatctgcaat tttgcctaga 
tttactgaga actgctttta acttacccct 
acgtacctct tatctgcaat tttactgaga 
tttacttaga actgctttta agctacctct 
aacaaacctc ttatctgcaa ttttacttag 
ttttacttag aattgctttt actattcctc 



tttaatt-fctg ttaaagaaga tggtatgtgg 3840 

ccaggaatgc agaaagacca aagcatacat 3900 
attttagacc gaaacaatgt ggaattggcc 3960 

gttccaaaga atgtatctaa aaaagattat 4020 

gaagactata tcaacaacaa atggatcaaa 4080 

tttaaaaccg ttaaaaaaat ggatgaatat 4140 

acaactaatg aaacagaaag tcgtaactat 4200 

ggttatgttg gtcccattaa ctctgaagaa 4260 

gatgatgcag ttattggtaa aaagggactc 4320 

gaagatggct atcgtgtcac aatcgttgac 4380 

atagagaaaa agaaaaaaga tggcaaagat 4440 

aagagtattt ataacaacat gaaaaatgat 4500 

acaggtgaat tattagcact tgtaagcaca 4560 

ggcatgagta acgaagaata taataaatta 4620 

aagttccaga ttacaacttc accaggttca 4680 

ttaaataaca aaacattaga cgataaaaca 474 0 

aaagataaat cttggggtgg ttacaacgtt 4800 

gacttaaaac aagcaataga atcatcagat 4860 

ttaggcagta agaaatttga aaaaggcatg 4 920 

agtgattatc cattttataa tgctcaaatt 4980 

ttagctgatt caggttacgg acaaggtgaa 5040 

atctatagcg cattagaaaa taatggcaat 5100 

aaaaacaaag tttggaagaa aaatattatt 5160 

ggtatgcaac aagtcgtaaa taaaacacat 5220 

ttaattgcjca aatccggtac tgcagaactc 528 0 

attgggtggt ttatatcata tgataaagat 5340 

aaagatgtac aagataaagg aatggctagc 5400 

gatgagctat atgagaacgg taataaaaaa 54 60 

gcaatccgta acgatggttg cttcactgtt 552 0 

ttctccctta aatacaattt cttcattttc 558 0 

tccattttct ttttttatgg atttcttatt 5640 

acctgggtat acaggtttaa taaatttaac 5700 

ttctccgtat ttaccttctt ctacccataa 57 60 

agatgcaatg atacctttaa atctactttg 5820 

ttgaggatca aaagttgttg caaattggat 5880 

tgttttgaat gtttctccta ctataaaatc 5940 

ttcaaattaa ttttttagta tgtaacatgt 6000 

agactcaoct ctaactttct attgagacaa 6060 

gataccatca actccccaat tagcaagttg 6120 

tacgttcaat tcataacccg cttcttttac 6180 

atcttcagtg tttactattt tagcattaca 6240 

aaacgaagtt gtatggaata taactgctct 6300 

tttaacaagc acaacattaa agcttaaatg 6360 

aattgttctt ccacttgctt aaccatactt 6420 

atacctttta attctacatt taaattcata 6480 

tcgaaagttg gcaaatgttc atctttgaat 6540 

tctttaattt catcataatt caattcagtt 6600 

aaataatcat catgaatgat aatcagttgt 6660 

aaccagttta taccttctac ttctgaagca 6720 

gctttactag gtaatcctct atgtccatat 6780 

tattttttta attaacgtaa ctgtattatc 6840 

agagatgaat atcataaata aagaagtcga 6900 

cgtctcatct catttttaaa aaatcattta 6960 

ctagtttttc tgtacccctt atctgcaatt 7 02 0 

atcagaattt tactgagaac tgcttttaac 7080 

actgctttta acgtacctct tatctgcaat 7140 

tatcagcaat tttgcatgga attgctttta 7200 

actgctttta acaaacctct tatctgcaat 7260 

tatcctgtaa ttttactgag aactgctttt 7320 

aactgctttt aacaaacctc ttatctgcaa 7380 

ttattagtat aatctcagta agaatgcgta 7440 
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taaaaatgaa aattacaacc gattttgtaa 
agagactaat ggctcgagcc atacccctag 
aaaaaataag catatcactg taaactttaa 
tcgagttagg atttatgtcc caagctcatc 
atcttatatg ttctgatatg gtcttctata 
tgtgtcaatt cacgttcgaa atcaaattct 
gtattcttcg actgataatt gctctctaga 
taatgctttg tcatatcctt taacgattga 
accataacta aatagacata cttctcttct 
attaaactta agtataatga tcctgtataa 
gttctgttgc aaagttgaat ttatagtata 
ctatttcaga tataaacaat ttaacaagga 
aagatataca ttgagttatc gtgatatatc 
tcatcattca acggtctacc gttgggttca 
gaagaaaaag cataaaaaag cttattacaa 
aaaaggaaaa tggagctatt tatatcgtgc 
ttggttgcgt aagcaacgag ataatcattc 
acaatttggt aaacctcaaa aggtaattac 
ggctaaagta attaaagctt ttaaacttaa 
gaataacctc attgagcaag atcaccgtca 
tatcaataca gcaaagaata ctttaaaagg 
gaaccgcagg tctcttcaga tctacggatt 
agcaagttaa gcgaacactg acatgataaa 
aacagaaccg aaaataatct cttcaattta 
taatatctaa agatttcagt tcatcataga 
attgattaaa taaattctct aatttctccc 
cattgatata gtcaatgaat aatgtacaaa 



gtctggacgc c"tgagggaat agtatgtgcg 7500 
gcaagcatgc acgtacaaaa tcgtaagata 7560 
aaaatcagtt tagtgatatg cttatttatt 7 62 0 
aagcacaatc g<gccactagt ttatttctct 7 68 0 
ctgtataagt a-tacttttga atatggatct 774 0 
tgattatcaa a"tgctgttaa agaatgtttc 7800 
ttctagcata t"ttaagtgtt tctctttatc 7860 
accactaaag a-tttctccta ctgctcctga 7920 
ggttggaatg t«gtggttctg taataacgaa 7980 
atgttaccaa catctctatt ccataatacg 8040 
attttaacaa aaaggagtct tctgtatgaa 8100 
tgttatcact gtagccgttg gctactatct 8160 
tgaaatatta agggaacgtg gtgtaaacgt 8220 
agaatatgcc ccaattttgt atcaaatttg 8280 
atggcgtatt gatgagacgt acatcaaaat 8340 
cattgatgca gagggacata cattagatat 8400 
agcatatgcg t-fctatcaaac gtctcattaa 84 60 
agatcaggca ccttcaacga aggtagcaat 8520 
acctgactgt cattgtacat cgaaatatct 8580 
tattaaagta agaaagacaa ggtatcaaag 8640 
tattgaatgt atttacgctc tatataaaaa 8700 
ttcgccatgc cacgaaatta gcatcatgct 87 60 
ttagtggtta gctatatttt tttactttgc 8820 
tttttatatg aatcctgtga ctcaagattg 8880 
caatgttctt ttcaacattt tttatagcaa 8940 
gtttgatttc actaccatag attatattat 9000 
ttatcactca taacagt 9047 



<210> 740 
<211> 1832 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Acinetobacter baumanii 

<400> 740 I 
cgcgtgtacg taatactggt gaagccccac 
aattacttgt tattgacggt ccattcacag 
acgataagtc acgtttaacg ttgacgatta 
tcgaatttcg ccaagtcgaa aaaacgattt 
tatcgggcat tgttgttgta acagtattgt 
tgtacccaga ggtatttaaa atggctaaga 
cagctggtaa agcaaatcca tctccaccga 
acatcatggc gttctgtaaa gaattccaaa 
taccagactt accaagtttt acgggcggct 
gctacatcga gccacgttta aagaatgtac 
tatggttgat gctctcaaag acagtcattg 
taattgtgca tgcggataca gagcagagta 
atcaattaga acagttgttg gcgactccag 
cgcattttga atcaattact ggtaaagcaa 
aatatattcg tgcaggcgat gtgatgcagg 
ttgatggaga agctttacag gtttaccgtg 
tattcttggt tcaaggacaa acgattactg 
caccggaaat tttatctcgt ttagaaaatg 
ctagaccgcg cggtaaaact aaagaagaag 
atgaaaaaga gattgctgaa catttaatgc 
gcgtatcaaa aataggtaag gtccaagtta 
atgtcatgca tattgtttca aatgtacaag 



gtcctaagac gatgtttgag ccaggtgaag 60 
actttaaagg tcjtggtggaa gaggttcaat 120 
acgtgtttaa tcgaccaact caggttgaac 180 
aatcttaatt gcjttgaaaag cgcccgattt 240 
tacgtagaaa ttggggagcc taacggcgtt 300 
agattgacgg ctatatcaag ctgcaagttc 360 
ttggtcctgc actaggtcaa cgtggtgtaa 420 
aacaatttaa acjttcctacg gctaaactct 4 80 
tggtgggtta tttgggctac gatgctgtcc 540 
ctgcggctga tccgattacg ctgccagatt 600 
tttttgacaa tcttaaagat acgctatttt 660 
atgcttatga agacgctcaa caaaaattag 720 
ttagtttgca acjcgcgacca catacgcctc 780 
aattcttaga gacggtagag aaggttaaag 840 
ttgtacctgg gcagcgtatg gtttctgatt 900 
cattacgtca tttaaatcca tcaccttatc 960 
ataaaaaacc atttcatatt gttggttcat 1020 
gtattgctac ag-ttcgacct ttggcaggaa 108 0 
atatagcatt ag-aaaaagat ttgctttctg 1140 
tgattgatct tg-ggcgaaac gatgtagggc 1200 
cggatcaaat gg-tgatcgag cgctattcac 1260 
gtgaagtgcg tg-atgatatc gatgcacttg 1320 
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atgtatttaa agccaccttt ccggcaggaa cgttatcagg tgccocaaaa attcgtgcaa 1380 

tggaaattat tgatgaagta gagcctgtga aaagaggagt ttttggcggg gctgttggtt 14 40 

atttgggatg gcatggtgaa atggatatgt cgattgcaat ccgtacttgt gttatccgtg 1500 

acaaaaaggt gtatgtacag gctggtgcag ggctagttgc tgactcaaat ccagaatctg 15 60 

agtggaatga aacccaaata aaagctcgcg cagtgatcaa agcggttgaa ttatcatcaa 1620 

acggattgat tttatgagtt tttagcggtt tttttaaaaa aaccgcttgc atcgtttgga 1680 

agttttgcta aactgcacac cgttccgata cgaaacgtac gaaacactaa gaaacgccgg 17 40 

catagctcag ttggtagagc aactgacttg taatcagtag gtccacagtt cgaatccgtg 1800 

tgccggcacc attttaaagt gttaagttaa tt 1832 

<210> 741 
<211> 382 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Acinetobacter baumanii 



tcaaaaatct ggtggtacst gtgccttcat 60 

tgcacgcaag cttggtgtsg atattgataa 12 0 

gcaagcactt gaaattgcfcg acatgcttgt 18 0 

ggactcggta gctgcactta cccctaaagc 24 0 

tatgggtcta caagcgcgtc ttatgagcca 300 

acgttcaaac tgtatggtta t'cttcattaa 360 
382 

<210> 742 
<211> 344 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Acinetobacter baumanii 



cacaatgaca 
tgatgctgag 
cctacttgtt 
ccgttcaggc 
agaaatcgaa 
ggcacttcgt 
ccagattcgt 



ttgcaagcaa 
cacgccctag 
tcacaacccg 
gcaattgatt 
ggtgagatgg 
aaaattacgg 
atgaaaattg 



ttgctcaatg 
accctcaata 
acaatggtga 
taatcgttg t 
gtgactctca 
gtaatgctaa 

gt 



<400> 742 

aaatctgccc gtgtcgttgg tgacgtaatc 
gtttatgaaa ccattgttcg tatggctcaa 
ggtcagggta acttcggttc gatcgatggc 
gtccgtatga ctaagctggc acatgagctt 
tgggaagata actacgacgg ttcggaacgt 
aacttgttaa tcaacggtgc tgcgggtatc 

<210> 743 
<211> 909 
<212> DNA 

<213> Artificial Sequence 



ggtaaatatc acccgcatg-g tgactcagct 60 

gactttagct tacgttattt attggttgat 120 

gatagcgccg cggcaatgcg ttataccgaa 180 

cttgcagatt tagaaaaag-a cacagttgac 240 

atccctgaag tacttccgac acgtgttcca 300 

gccgtaggta tggc 34 4 



<220> 

<223> Acinetobacter baumanii 



<400> 743 

gataatagct ataaagtttc aggtggctta 
ctttcaagta aattgcatct aacaatttac 
catcatggtg atccgcaata tccattgcgt 
actgtacgtt tttggccaag tgcagaaaca 
ctagcacgcc gtttacgtga gctttctttc 
gatgaacgta ttaaccttga gcatgtgtat 
aaatacatca acgaaggtaa aaaccatctc 
gacaacggta ttgctgtaga agttgcattg 



cacggcgtag gtgtttctg-t tgttaacgca 60 

cgtgctggtc aaatccatga gcaagaatat 120 

gtgattggtg aaacggata a taccggaaca 180 

ttcagtcaaa ccattttta a tgttgaaatt 240 

ttgaatgctg gtgtacgtat cgttttacgt 300 

gactatgaag gcggtttat c tgagtttgta 3 60 

aacgaaatct tccatttcac agctgatgct 420 

caatggaacg atagttacca agaaaatgtt 4 80 
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cgctgtttca caaacaacat tccacaaaaa gatggtggta cgcacttagc aggtttccgc 540 

gcagctttaa cacgtggctt aaaccagtat ctggaaaatg aaaatattct caagaaagaa 600 

aaagtgaatg tgactggtga tgatgcgcgt gaaggtttaa cagcgatfcat ttctgttaag 660 

gttcctgatc caaaattctc gtctcagaca aaagaaaaat tggtatcaag tgaggtaaaa 720 

ccagcggtag agcaagcaat gaacaaagag ttctctgctt acttactfcga gaatccacaa 780 

gctgcaaaat caattgcagg caagattatt gatgctgcac gcgcacgtga tgctgcacgt 84 0 

aaagcacgtg aaatgacacg ccgtaagagt gcattagata ttgcaggfctt gcctggtaaa 900 

ttggctgat 909 



<210> 744 
<211> 1430 
<212> DNA 

<213> Artificial Seguence 
<220> 

<223> Acinetobacter baumanii 



<400> 744 

aacttgctct ttcgtgagtt cagtaaatga cttttcttgt atatggtaga gtttaggtat 60 
cgaaactctg ccattttttt tatctatgtt atattttccg cagttttfcga attttgacta 120 
tttgaggcaa accgcttggc gaccccattt tggtacaaca ccgcacttca tttattaaaa 180 
ccgttttatc gttggcggat aaaaagacgt gcagaaagtc tagaattata tcagcaggaa 24 0 
tgtctggaaa gatttgggcc atttgaagca ccgaagaatg taaaagccjat ctggtttcat 300 
gctgtttcag tcggggaaac caatgctgca cagcccttaa ttgaatatta cctaaaactt 360 
gggcagccag tcttagtgac caataccacg aaaacaggtc aggctcgtgc caagtcactt 420 
tttctaaaag aaccatattt agatttattt caagccgttt atttgcctgt agaccagaag 480 
cctcttttaa aaaaattttt tgagttatat cagccaaagc ttttagcact ggttgaaact 540 
gaactctggc caaatttaat cgatcaagcc aaattacagc atgtaccttg tttgctgctt 600 
aatgctcggt tgtcagaaaa atctgcaaaa ggatatggca aagtctcg-gg tttaaccgca 660 
ggtatgttaa aacagctgga ctgggtgtta gctcaagata gtgcaactcg tcagcgttat 720 
gttgagcttg gtttagacga acacaaaagt caggtcgttg gtaatattaa gtttgatatt 780 
catgcgccag aggcttttat taaacaagct gcccaattgc atcagcaatg gtatctggaa 840 
aatcggcagg ttgtgacgat tgccagtaca catgcacccg aagaacaaca aattttggaa 900 
gcactcgcac cttatttaaa ttcagatcgt gagttggtgt gtattgtg-gt gcctcgtcat 960 
cctgagcgtt tcgatgaagt atttgaaatt tgccaaaatt taaatttaat tacgcatcgt 1020 
agaagtatgg gccaaagtat tcatgccagc acgcaagttt atctcgct ga cagtatgggt 1080 
gagctctggt tatggtatgc cttaagtcag gtgtgttttg taggcggttc tttaaatgag 1140 
ccgggtgggg ggcataatat tttagaacct atggttttaa atgtacct ac tgtagttgga 1200 
ccgcgttatt ttaactttca aacgattgtc gatgagttca ttgatgaaaa tgctgtgctt 1260 
attgctcaag atgcgcagca ggtcgttgat atctggttag catgtctg-gc agaacttgag 1320 
gcgactgaac agttagtgat acaggcgcat aaagtgttgc aacgtaat ca aggttcctta 1380 
caaaaacata tcggggtgat taatcgctat ctggccgaaa aatcatgaat 1430 

<210> 745 
<211> 3609 
<212> DNA 

<213> Artificial Seguence 
<220> 

<223> Concatenation of C. jejuni genes 
<220> 

<221> misc_feature 

<222> 478-527, 1005-1054, 1457-1506, 2014-2063, 2562-2611, 

3071-3120 
<223> n = A,T,C or G 

<400> 745 

atgataggtg aagatataca aagagtatta gaagctagaa aattgatt"tt agagatcaat 60 

ttgggtggaa ctgctattgg aacaggaatt aattctcatc ctgattat cc gaaggttgta 120 

gaaagaaaaa taagagaagt gacaggtttt gaatatactg tggctgagga tttaatcgag 180 
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gcgactcaag atacgggagc ttatgtacaa atttcaggtg ttttaaaacg tgt"tgcaaca 240 
aaactttcta aagtatgtaa tgacttaaga cttttaagta gtggtccaaa atg-tggtctt 300 
aatgagatta atcttccaaa aatgcaacca ggtagttcta tcatgccagg taaggtaaat 360 
cctgttattc ctgaagtagt taatcaagtt tgttattttg ttattggagc aga cgtaact 420 
gtaacttttg cttgtgaggg tggacaatta caacttaatg tttttgaacc agt-tgtannn 480 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnngat ccti:ttacgg 540 
ctgatcctac tatcatagta ttttgtgatg tgtatgatat ttacaaagga caa atgtatg 600 
aaaaatgtcc aagaagcata gcaaaaaaag caatagaaca ccttaaaaat agt ggcatag 660 
ctgatactgc ttactttgga ccagaaaatg aattctttgt ttttgatagt gta aaaatag 720 
ttgatactac tcattgttct aagtatgaag ttgataccga agaaggagag tggaatgatg 780 
atagagaatt taccgatagc tacaatactg gacacaggcc aagaaacaaa ggt<ggatatt 840 
ttccagttca gccaattgat tctttagtag atattcgttc tgaaatggtt caaacccttg 900 
aaaaagtagg tcttaaaact tttgttcatc atcatgaagt tgcacaagga caa«gctgaaa 960 
taggagtaaa ttttggcacg cttgtagaag cagctgacaa tgttnnnnnn nnnnnnnnnn 1020 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngaaatg aaaaaacgtt ctt-ttatcca 1080 
tgaaggtatg caccgtcttt ttgattcttt ccctgataat gctcatccta tggcagtttt 1140 
acaaggtgct gtttcatcgc ttagtgcttt ttatcctgat catttaaaca tgaatgtaaa 1200 
agaagaatat atggaaatgg cagctagaat agtagctaaa atccctacta tag-tggccac 12 60 
cgcttataga tataaacacg gctttcctat ggcttatcca aatttagatc gtg«gttttac 1320 
agaaaatttc ttatatatgt taagaaccta tccttacgat catgtagagc ttaaacctat 1380 
agaagtaaaa gcacttgata cagtttttat gcttcatgca gatcatgagc aaaatgcttc 1440 
aacttcaaca gttcgtnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 
nnnnnnggcg gacatttaac tcatggtgca aaagtaagtt cttcgggcaa aat<gtatgaa 1560 
agctgttttt acggcgtaga acttgatgga agaattgatt atgaaaaagt aagsgaaatt 1620 
gctaaaaaag aaaaaccaaa acttatagtt tgtggagcta gtgcttatgc aagsgtgatt 1680 
gattttgcta aatttagaga aattgctgat gaaataggtg cctatctttt tgc-bgatata 17 40 
gcacatattg caggtcttgt tgtggcaggt gagcatccaa gtccttttcc atacgctcat 1800 
gtagtaagct caactacaca taaaactttg cgtggcccaa gaggtggtat tat-tatgaca 18 60 
aatgatgaag agcttgctaa aaaaattaat tctgccattt ttccaggtat tcaaggtggt 1920 
cctttgatgc atgtaattgc tgcaaaagca gtaggattta aatttaatct tagcgatgag 1980 
tggaaagttt atgcaaaaca agtaagaact aatnnnnnnn nnnnnnnnnn nnnnnnnnnn 2040 
nnnnnnnnnn nnnnnnnnnn nnnggactta atatcaatga aaattgtgga gct-fctacatc 2100 
ctgcaaattt agctgctgag gtaaagcgtt tgcgtgcaga tgtgggcttt gct"bttgatg 2160 
gtgatgcaga tcgtttggtg gttgtagatg aaaaaggcga agtggctaat ggg<gatagtt 2220 
tattgggcgt attggcactt tatcttaaag aacaaggtaa attacaatca agtgttgtgg 2280 
ctactataat gagtaatggt gctttaaaag agtttttaaa taaacatggt atagaacttg 2340 
atacttgtaa tgttggcgat aaatatgtgc ttgaaaaact aaaagccaat ggtggaaatt 2400 
ttggtggaga acaaagtggg catattattt ttagcgatta tgcaaaaact ggagatggtt 24 60 
tgatagccgc cttgcaattt agtgctttaa tgctttctaa gaaaaaatct gcasgttcta 2520 
ttttgggtca agttaaacct tatcctcagc ttttaaccaa tnnnnnnnnn nnnnnnnnnn 2580 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nttacattta agcggctatg acttaagctt 2640 
agaagatctt aaaaatttcc gccaacttca ttctaaaacc cctggacacc ctgaaatttc 27 00 
aactcttgga gtagaaatcg ctacaggccc tttaggacaa ggcgttgcca atgctgtagg 27 60 
ctttgctatg gcagcaaaaa aagcacaaaa tttgctaggc agtgatttaa tcgsitcataa 2820 
aatttattgt ctttgcggag atggggattt acaagaaggc atttcttatg aagcttgttc 2880 
tttagcagga cttcacaaac ttgataactt catacttatt tatgatagca acastatctc 2940 
tatagaaggc gatgtaggtt tagcctttaa cgaaaatgta aaaatgcgtt ttgaagcaca 3000 
aggatttgaa gttttaagta taaatggaca cgattatgaa gaaatcaata aagccttaga 30 60 
acaagctaaa nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3120 
gccggtgctt cagatcctgc ggctttacag tacttagctc catatactgg tgtsaccatg 3180 
ggtgaatttt ttagagacaa cgcaaaacat gctttaattg tttatgatga tttgagcaag 3240 
catgctgtag cttatcgcga aatgtctttg attttacgtc gtcctccagg tcgfcgaagct 3300 
tatccaggtg atgtttttta ccttcattca agattgcttg aaagagcaag caag-ctaaat 3360 
gatgaattag gtgctggttc tttgacggca ttgccgataa ttgaaacaca agcaggagat 3420 
gtttctgctt atattccaac taatgttatt tcaattacag atggacaaat tttottagaa 3480 
actgatttat ttaactcagg aattcgtcct gcaattaatg ttggtttatc agtatctcgt 3540 
gtaggtgggg ctgctcaaat taaagctaca aaacaagttt caggcacatt aagacttgac 3600 
cttgctcaa 3609 

<210> 746 
<211> 1214 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bordetella pertussis 



<400> 746 

ttcgggatgg gcatgggcgt cgcacacgac gtcgaacacc gtgccggttt catccagtgc 60 

gggttcctgc tcgaccgtgg ccacgcgcag gcccgaactg cgggcgatgt cgccgtcgtc 120 

gggctgcgtg cggccatcga gcaggcgcag cagcgaagat ttgcccgcgc cgttgcgccc 18 0 

gatcaggccg atgcgctcgc cttcctggat ggcgaaatcg gcgtggtcca gcaaggggtg 24 0 

gtggccgtag gccagttgaa cgtcggtaag cgtaataagg gaagcgaggg ccatggacag 300 

acagagtgat gcgtattggc ctgtattgtc gcaggaacgc gcggggcggg cagcgcgcgg 360 

ccgggagccg gggcgccgac tgtactaaaa tgccgctcgc agggcagatc gggcaatcgc 420 

gggggatgca aatccttcga ggaaggtccg gactccacag ggcgggatag cggctaacgg 480 

ccgtccggcg acgctggcgg gcttgcccgc cggaaaagcc gaggaacagg gocacagaga 540 

cgagtctgtc atgagggcgc gcctggcgcg caccggcacg gccatctccg tgccgcgccg 600 

tccggaaacg ggcggcggca tgacagggtg aaacgcggca acctctatcc ggagcaacat 660 

caaataggca tgcgtacggc cgtaaggccg ggaagggcgg ctccgtccaa gcatgcgggt 72 0 

aggtggctgg agcggtccag caatggttcg ccaagaggaa tgattgcccg ccggggaaac 780 

ccggcgtaca gaatccggcc tatagatctg ctctgcactg cattttcatg acagccggcc 840 

ggaatccggc cggctggctt acggcgccct acgtaaaaca atgaaagcac gcgcccaggc 900 

cctacttctg atgaatgaca tttaatagtc ggcgcaagct gctgatttat cagcaagttt 960 

tccgcgcgcc gtttttacaa ttggctctaa gtccttgttg tgattgaaaa aattgcaacc 1020 

acggccttga cccatgaagt tgcttcccgt agagtgggaa aaagtaggaa tttgtgcaac 1080 

taagtggggc aaacgggtgt tccagggaag cagcgcactc acgttggatg ctaaggggcg 1140 

gatctcgatt ccgacccggc atcgtgacgc gctcatggac cgtgccgaag gccggttgac 1200 
cctgacccgt catc 1214 

<210> 747 
<211> 925 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Burkholderia mallei 



<400> 747 

gaccgatccg gcaagccgcg gcttcgcgcc 
gccggcgaac gcgtcgccgg ccttctacga 
acgcgcaccg gatgtcctcg cccgtgccgc 
gcgcggcgcc gcgcaccgcc ggtgcgcgct 
tcgcgccgag cggatgccgc tgtacggtgc 
gtcgcggctt tcgcggttcg ccgcgaaggg 
gtgatggcta acggccatcc gtggcgacac 
gatggcccgg cgcaagccgg gatcaggcaa 
ggctgcggcg acgcagaccg gcacggtaac 
cgcgcatctt cggatgcagg acggtgcccc 
cgtcagcaat ggcgcgccta gaggaatggc 
gttcgcacag aatccggctt atcggcccgc 
cgatgcggcg ccggcctttt ttccgtttcg 
aagcgcgtta cgcggcgacg atttcgaacg 
tgatcgacgc cgaacagtat ttgtcgtgcg 
ggttcaggtt gcggcccgtc accgt 

<210> 748 
<211> 713 
<212> DNA 

<213> Artificial Sequence 



gctgccggtg ctgggcgtgc ccggctggtg 60 

cgatccgcag gtgtttcgcc goggccgccg 120 

gtgacggacg acgcttgcct cg-gcgcgccc 180 

ttcgcgcaag ccgccgccgc ggcaagggcg 240 

tagaatcccg ctcgcaaagc acggctaggca 300 

cgaggaaagt ccggactcca acagggcagg 3 60 

gcggaacagg gcaacagaaa gcaaaccgcc 420 

gggtgaaacg gtgcggtaag acjcgcaccgc 480 

ctccacccgg agcaattcca acjtaggcgga 54 0 

cgtctcgttc gcgggtagga agcttgagcg 60 0 

tgccacgggc cgcgcgcctt ccjggcgtgcg 660 

tttgccgccc gatgacgaaa gcjccggcgcc 720 

cggtccgcgc gcacggccgc ttgacggacg 780 

aatgcgacag ctcggccgtc ttggcgatca 840 

acagattgat cgcgcgctcg acggtcgcgg 900 
925 



<220> 
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<223> Bacillus subtilis 
<400> 748 

gttcttaacg ttcgggtaat cgctgcagat cttgaatctg tagaggaaa<g tccatgctcg 50 

cacggtgctg agatgcccgt agtgttcgtg cctagcgaag tcataagcta gggcagtctt 120 

tagaggctga cggcaggaaa aaagcctacg tcttcggata tggctgagt a tccttgaaag 180 

tgccacagtg acgaagtctc actagaaatg gtgagagtgg aacgcggtaa acccctcgag 240 

cgagaaaccc aaattttggt aggggaacct tcttaacgga attcaacgga gagaaggaca 300 

gaatgctttc tgtagataga tgattgccgc ctgagtacga ggtgatgagc cgtttgcagt 3 60 

acgatggaac aaaacatggc ttacagaacg ttagaccact tacatttaaa atgatgaaaa 4 20 

caagctctcc cgtataagga gagcttttat cttgaaaaga gaaaagttt"t aaaagacagg 4 80 

gtgatacgat gaagaagtat acactaattg caacggcgcc gatgggcati: gaagctgttg 540 

tcgcaaagga agtacgagat ttaggatacg aatgcaaggt tgataacgg<g aaagttattt 600 

ttgaaggtga tgcacttgcc atctgccgtg cgaacctttg gcttagaaca gccgaccgca 660 

taaaggttca ggttgcttct tttaaagcga aaacatttga tgaactgtt"t gaa 713 

<210> 749 
<211> 828 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Clostridium perfringens 
<400> 749 

aaaacaagtt ctttttcata taatgatgta taagtaatac ttagggtgga gaattatgtg 60 
gtccaatttt ggaaatagat ttaatggaga ctgctttaaa gacttaagga gagatttcaa 120 
tagattaatg agaaatttta agagaaatgc ttgtaaaaga tgtcttatta ataattgcta 180 
tttcagaaat gccttaaagt ggggggctgt aggtggcata ttaaccttcc ttataataag 24 0 
tcaaatagga gttcctttag caattgtttg tattggaata gtagcaatat ttgtgatttg 300 
taataaatgg tagaaaaaat aatttgaaaa aaataagtat atatgttaat attaatcttg 360 
cgagtaagcc agacaatcgc tgctagtctt agaactagga gaggaaagtc cgagctccat 420 
agggcaggat gctggataac gtccagtgga ggtgactcta aggatagtgc aacagaaata 480 
aaccgcctag atttatctag gtaagggtgg aaaggtgagg taagagctca. ccagggtata 54 0 
ggtgactata ctgctatgta aaccccatct ggagcaagac caaataggag gacatatagg 600 

ggctgcccgt cccgtcctcg ggtgtgtcgc ttgagcctat cggcaacggt aggcctagat 660 

agatgattgt caaatacaga actcggctta tagacttatc tcgtatttat aaaacactag 720 

gttataatac ctaagtgttt ttttatttta caaaaaaata tactgatagt ttctttcctt 780 

attaacttta atcttacagt attaatttaa ttttattgga tatatcta 82 8 

<210> 750 
<211> 777 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Escherichia coli 
<400> 750 

gaagctgacc agacagtcgc cgcttcgtcg tcgtcctctt cgggggagao gggcggaggg 60 
gaggaaagtc cgggctccat agggcagggt gccaggtaac gcctgggggg gaaacccacg 120 
accagtgcaa cagagagcaa accgccgatg gcccgcgcaa gcgggatcag- gtaagggtga 180 
aagggtgcgg taagagcgca ccgcgcggct ggtaacagtc cgtggcacgg- taaactccac 240 
ccggagcaag gccaaatagg ggttcataag gtacggcccg tactgaaccc gggtaggctg 300 
cttgagccag tgagcgattg ctggcctaga tgaatgactg tccacgacag aacccggctt 360 
atcggtcagt ttcacctgat ttacgtaaaa acccgcttcg gcgggttttt gcttttggag 420 
gggcagaaag atgaatgact gtccacgacg ctatacccaa aagaaagcgg cttatcggtc 480 
agtttcacct ggtttacgta aaaacccgct tcggcgggtt tttgcttttgj gaggggcaga 540 
aagatgaatg actgtccacg acactatacc caaaagaaag cggcttatcg gtcagtttca 600 
cctgttttac gtaaaaaccc gcttcggcgg gtttttactt ttggaggggc agaaagatga 660 
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atgactgtcc acgacactat acccaaaaga aagcggctta tcggtcagtt ttacctgatg 720 
tacgtaataa accgttccgg cgggtttcag attgttgagt gcgctttatt catgccg 777 

<210> 751 
<211> 834 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Rickettsia prowazekii 
<400> 751 

taataaatta atttattcat atcaaagttt gctaatagtg tatatgtttt taagtactac 60 

attatttata cattaggaaa aaaatagtag caatgttaaa ttaagatatc tttctttaaa 120 

gaattgacat tatggaattt gttattataa ttgtaattat attgtgtgta caataattac 180 

aataaatttt cccctcagaa cctaacaagc taattgaaat tcttttaaca tattattgac 240 

taatttaaga aaagctacca taatctaaat ggtcgtgcag ttgcgtgatg ataatcacga 300 

ggaaagtccg gactctatag aggtatggtg ccggttaaca tccggcagag tattattact 360 

ttagggctag taccacagaa aatataccgc cgagtatttc ggtaagggtg aaasaggtgtg 42 0 

gtaagagcac accggtaagt tggcaacaag ttacgcatgg ttaaccccac caagagcaag 480 

atcaaatagg cattacagaa tttaaatatt tatttaagtt cctgggttac ctctaatcgg 540 

attgtaatgc gggtagatcg cttgaggtaa acggtaacgt ttatcctaga taaataactg 600 

caatgaatta atattcatac agaatccggc ttatagacca gatgagcagg tatfcacatgt 660 

gttaataccg cagagttatt gcgagtaact gaaaaaagta tggcaatcta gaaaaataat 720 

cagattctgt ggatttttag tgttccctcg caatgacgaa aataatacac gtatagatta 780 

cagcatgaga tgatactaat acttactgta acataattta atgaaaaagt tata 834 

<210> 752 
<211> 783 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Staphylococcus aureus 
<400> 752 

tgatattttg ggtaatcgct atattatata gaggaaagtc catgctcaca cagtctgaga 60 
tgattgtagt gttcgtgctt gatgaaacaa taaatcaagg cattaatttg acggcaatga 120 
aatatcctaa gtctttcgat atggatagag taatttgaaa gtgccacagt gacgjtagctt 180 
ttatagaaat ataaaaggtg gaacgcggta aacccctcga gtgagcaatc caaatttggt 240 
aggagcactt gtttaacgga attcaacgta taaacgagac acacttcgcg aaatgaagtg 300 
gtgtagacag atggttatca cctgagtacc agtgtgacta gtgcacgtga tgagjtacgat 360 
ggaacagaac atggcttata gaaatatcac tactagttta gctctcctag atgatggaga 4 20 
gcttttttca tgaaaagaac acttaaaatt aacaccttgt cttgatataa tgacactgcc 4 80 
ttgttttaaa atagtaagcg gatgcgttaa tgtatcagcg attaaatttg ttggaaatgt 540 
ataaaaaaca caagctaaga ataaaatacc tgtataaaag gagaatcata tatgtttcaa 600 
ttacttgcag tttgtccgat gggattagaa gctgttgttg ctagggaaat tcaeagaatta 660 
ggctatgaaa caaatgttga aaatggtcgt atattttttg aaggagacgc aagtgcaatt 720 
gtaaaggcaa atttatggtt gcgcacagca gaccgaatca aaattgttgt tggacgtttt 7 80 
aac 783 

<210> 753 
<211> 1086 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vibrio cholerae 
<400> 753 
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gtgaaatcgg tctggcgatc gagatgggct gtgatgcaga agacatcgca ctgaccattc 60 

atgctcaccc aacgctgcat gagtctgtgg gtctggcggc tgaagtattc gaaggcacca 120 

tcactgacct gccaaacgcc aaagcgaaaa agaaaaagta atttctgatt cactggtttg 180 

gttaaaaagt gtttgattga agaaaccgct gcttgcagcg gttttttttc gcttttgatt 240 

cccaattgaa tgcaaagtaa gcagttaaac cgagatttaa cgcgctggcg gattgtttct 300 

ggactctgct cacacttctc gctacaatcc gccgcggagt tgactgggfca gtcgctgcct 3 60 

tattgacgtc ccttggccta tcgccaggga gactgataag ggggaggaaa gtccgggctc 420 

catagagcag ggtgccaggt aacgcctggg gggcgcaagc ctacgacaag tgcaacagag 480 

agcaaaccgc cgatggcctc tccttcggga tgggatcagg taagggtgaa agggtgcggt 540 

aagagcgcac cgtgcgactg gcaacagttc gtagcagggt aaactccacc cggagcaaga 600 

ccaaataggc ctccacatag cgttgctcgc gttaggaggc gggtaggtfcg cttgagccag 660 

tgagtgattg ctggcctaga ggaatggcta ctaccgcgca agcggaacag aacccggctt 720 

atacgtcgac tccacctatt gcagacccat catagccttg tgttatggfcg ggttttttgc 780 

tttttgctga ctcaggtaac agcaaaacca ttaaacttag cgcaaagtca cgtcctgtat 840 

acgcttatgg ctagcggtag tgtctgaaat cggtacacta acccatatcga cgggcatgcc 900 

cagcacaacg agaaaatcat gactgcttct ttccaacaca tctctgtafct attgaacgaa 960 

tcgattgaag gattggcgat caagccggat ggtatctaca tcgatggcatc ctttggccgc 1020 

ggtggtcaca gtcgcaccat tctcgctcag ttaggcccag aaggtcgtct gtacagcatt 1080 
gaccgt 1086 



<210> 754 
<211> 369 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Coxiella burnetii 



<400> 754 

gagttaaccg gagtatccat cgtgacttac 
aaaatcaccg ttaataaagc cgttttagaa 
gaaggagatg ggattggcat tgatatcgcg 
gtggaaaaat cctacgctgg aaagcgaaaa 
aaggctacga aagtgtatgg caaagacaat 
aaagaatacc aagtggccat taaaggtccc 
tcgctcaat 

<210> 755 
<211> 1317 
<212> DNA 

<213> Artificial Sequence 



caacacatca aagttcccag ccaaggtgaa 60 

gttcctgacc gacccattat tccctttatc 120 

cccgtcatga aaaacgtggt cgatgccgcc 180 

attgaatgga tggagatcta cgccggagaa 240 

tggctgcctg atgagacact cgaagccatt 300 

ttaaccacgc cggtgggggg tggcatacgt 3 60 
369 



<220> 

<223> Acinetobacter baumannii 



<400> 755 

attacgacgg aaattgtaag taattttgtc 
cttggaaagt ctatcaagtg tttgtatgat 
gggtaaaaaa atatctcagg ggccaataaa 
ctctggagga tatccatgaa attgagtcgt 
ccattagctg ctgctaatgc tggcgtaaca 
caagacagcc aacacaacaa tggcggtaaa 
caagacgatt tattcgttgg cgcagctctt 
gaagctgaat ataaccaagt taaaggcgac 
aaacaaaaac aaatcaacgg taacttctat 
gacagcaaaa tcaagccgta cgtattatta 
ggcgtaaacc gtggtacacg tggtacttct 
ggtgctttct ggcgcttaaa cgacgcttta 
aatgctgatg aagagttctg gaactataca 
ggtcacttga agcctgctgc tcctgtagta 
gctccacaac cacaagagtt aactgaagac 



aataatttac ttgattaaaa ttatcaagca 60 

tcaaatgtga atagcttaaa aataatactg 120 

tttaggctga gcttgaacaa caattgttat 18 0 

attgcacttg ctactatgct tgttgctgct 24 0 

gttactccat tattgcttgg ttacactttc 300 

gatggtaact taactaaccjg tcctgagtta 360 

ggtatcgagt taactccatg gttaggtttc 420 

gtagacggcg cttctgctcjg tgctgaatat 480 

gttacttctg atttaattac taaaaactac 540 

ggtgctggtc actataaaha cgactttgat 600 

gaagaaggta ctttaggtaa cgctggtgtt 660 

tctcttcgta ctgaagctog tgctacttat 720 

gctcttgctg gcttaaacgjt agttcttggt 780 

gaagttgctc cagttgaacc aactccagtt 840 

cttaacatgg aacttcgtcjt gttctttgat 900 
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actaacaaat caaacatcaa agaccaatac 
ttatctgaat accctaacgc tactgcacgt 
cgtaagttga acgaacgttt atctttagct 
aacgaataca acgttgatgc ttctcgtttg 
attgctgaca acaaaactaa agaaggtcgt 
actggtagcc gtactgtagt agttcaacct 
caataatttg agttcttgaa cagtaaaaaa 

<210> 756 

<211> 4932 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Rickettsia prowazekii 



aagccagaaa ttgctaaagt tgctgaaaaa 960 

atcgaaggtc acacagataa cac-fcggtcca 1020 

cgtgctaact ctgttaaatc agcfecttgta 1080 

tctactcaag gtttcgcttg ggatcaaccg 1140 

gctatgaacc gtcgtgtatt cgcgacaatc 1200 

ggtcaagaag cggcagctcc tgcagcagct 12 60 

gcgactcgtt agagtcgctt ttttatg 1317 



<400> 756 

atggctcaaa aaccaaattt tctaaaaaaa 
acggctacta tagtagctgg tttctctggt 
aggacaacaa atgcagcagc tacaaccttt 
gctaatattc ctgtcgctcc aaattcagtt 
tttaatactc caaacggtca tttaaatagt 
gtaacaatta atgaggatac taccttagga 
ttctttaatt ttactgttgc tgctggtaaa 
gttcaagaag cttctaatac aataaatgct 
gctgctatta acgctaatga tcttagcggg 
tctgtattag aatttaattt aataaatcct 
gctaattcta aaatagttaa tggtggtaat 
caggtttcag ataacacttt tgctggtatt 
ttaatgttta attctactcc tgatgccgct 
actattaatt ttaatggaat agacggtact 
gctgctaccg aatttaatgt tacaggaact 
ttgaacactg cagcagtagc tggtaaactt 
ataggtacag ataatggagc aggtagagct 
aatgcagcaa caatttctgg acaagtttat 
gcaggtggac aagtcacttt tgaacacata 
tttaaaactg cagattctaa agttataata 
tttggtaatc ttgacacaca gattgtagtc 
ataggtgatg taaaaaataa cggtaatact 
gctttagtaa gtgctagtac tgatccaaat 
gcagaagggg ccggggttgt agaattatca 
aatggtggct ctatctttaa acttgctgat 
aatgctctta tgaataataa tgctcttgca 
ataattaccg gtgatatagg taacggtggt 
gctaacgatg cttcaaaaat attagcactc 
ggtggtgcaa ttcattttca agctaacggt 
aatattgtag ttaattttga tttagatata 
agtagtttaa caaataatca aactttaact 
aatactaaaa cacttgcaca attaaacatc 
gatgtcgcta ttaacgagtt agttatagaa 
acttacttaa taacaaaaac tatcaatgct 

gatcctctta atactaatac tactcttgct 
ccactttcta ctattcattt tgccactaaa 
ggtaaaggag taaatttata tgctaataat 
ttacacttta ggtctggtgg tacaagtata 
cataagctta ataatttaat attagataat 
acatttaatg gtggtactaa aattgaaggt 
actactgatc atgttgaatc tgctgataat 
cctataaccg taacattaaa taaacaaggt 
atttctggtc caggtaacat agtatttaat 



ataatttccg caggattggt aactgcttcc 60 

gtagcaatgg gtgctgctat gcaatataat 120 

gatggtatag gctttgatca agctgctggt 180 

attactgcta atgctaataa tcctattact 240 

ttatttttgg atactgcaaa tgatttagca 300 

tttataacaa atattgctca gcacggctaag 360 

attcttaaca taacagggca gggtattact 420 

caaaatgctc ttacaaaagt gcatggtggc 4 80 

ctaggatcaa taacctttgc tgctgcgcct 540 

acaactcaag aagctcctct tacacttggt 600 

gggacattaa atattactaa tggatttatt 660 

aagaccatta atatcgatga ttgtcaaggt 720 

aatactttaa atttacaagt aggtggtaat 7 80 

ggtaaattag tattagtcag taagaatggt 840 

ttaggtggta atctaaaagg tattattgaa 900 

atctctcaag gaggtgctgc taatgcagta 960 

gcaggattta ttgttagtgt tgataatggt 1020 

gctaaaaaca tggtgataca aagtgctaat 1080 

gttgatgttg gtttaggcgg taccaccaac 1140 

acagaaaact caaactttgg ttctactaat 1200 

cctgatacta agattcttaa aggtaacttc 1260 

gcaggtgtga ttacttttaa tgctaatggt 1320 

attgcagtaa caaatattaa tgcaattgaa 1380 

ggaatacata ttgcagaatt acgtttaggg 1440 

ggcacagtaa ttaatggtcc agttaaccaa 1500 

gctggttcta ttcagttaga tgggagtgct 1560 

gttaatgctg cgttacaaca cattacttta 1620 

gatggcgcaa atattatcgg ggcfcaatgtt 1680 

ggtactatta aattaacaaa tactcaaaat 1740 

actactgata aaacaggtgt tgttgatgca 1800 

attaatggta gtatcggtac tgttgtagct 1860 

gggtcaagta aaacaatatt aaatgctggc 1920 

aataatggtt cagtacaact taatcacaat 1980 

gcaaaccaag gtcaaataat cgttgccgct 2040 

gatggtacaa atttaggtag tgcagaaaat 2100 

gctgctaatg ctgactctat attaaatgta 2160 

attactacta acgatgctaa tgtaggttct 2220 

gtaagtggta cagttggtgg acagcaaggt 2280 

ggtactactg ttaagttttt aggtgataca 2340 

aaatccatct tgcaaattag caataattat 2400 

actggtacat tagaatttgt taacactgat 2 4 60 

gcttattttg gtgttttaaa acaagtaatt 2520 

gagataggta atgtaggaat tgtacatggt 2 58 0 
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atagcagcta attcaatttc ttttgaaaat gcaagtttag gtacatcttt attcttacct 2640 

agtggtactc cattagatgt tttaacaatt aaaagtaccg taggtaatgg tacagtagat 2700 

aattttaatg ctcctattgt agttgtatca ggtattgata gtatgatcaa taa^cggtcaa 27 60 

atcatcggtg ataaaaagaa tattatagct ctatcgcttg gaagtgataa cag-tattact 2820 

gttaatgcta atacattata ttcaggtatc agaactacaa aaaataatca agg-tactgtg 2880 

acacttagtg gtggtatgcc taataatcct ggtacaattt atggtttagg ttt agagaat 2940 

ggtagtccaa agttaaaaca agtgacattt actacagatt ataacaactt agg-tagtatt 3000 

attgcaaata atgtaacaat taatgattat gtaactctta ctacaggagg tat agcaggg 3060 

acagattttg acgctaaaat tactcttgga agtgttaacg gtaacgctaa cgt aaggttt 3120 

gttgatagta cattttctga tcctagaagt atgattgttg ctactcaagc taataagggt 3180 

actgtaactt atttaggtaa tgcattagtt agtaatatcg gtagtttaga tactcctgta 3240 

gcttctgtta gatttacagg taatgatagt ggggcaggat tacaaggcaa tatttattca 3300 

caaaatatag attttggtac ttataattta actattctaa attctaatgt cattttaggt 3360 

ggtggtacta ctgctattaa tggtgaaatc gatcttctga caaataattt aat atttgca 3420 

aatggtactt caacatgggg tgataatact tctattagta caacgttaaa tgt atcaagc 3480 

ggtaatatag gtcaagtagt cattgccgaa gatgctcaag ttaacgcaac aactacagga 3540 

actacaacca ttaaaataca agataatgct aatgcaaatt tcagtggcac aca agcttat 3600 

actttaattc aaggtggtgc tagatttaat ggtactttag gagctcctaa ctttgctgta 3660 

acaggaagta atattttcgt aaaatatgaa ctaatacgtg attctaacca ggattatgta 3720 

ttaacacgta ctaacgatgt attaaacgta gttacaacag ctgttggaaa tagtgcaatt 3780 

gcaaatgcac ctggtgtaag tcagaacatt tctagatgct tagaatcaac aaatacagca 3840 

gcttataata atatgctttt agctaaagat ccttctgatg ttgcaacatt tgt aggagct 3900 

attgctacag atacaagtgc ggctgtaact acagtaaact taaatgatac aca aaaaact 3960 

caagatctac ttagtaatag gctaggtaca cttagatatc taagtaatgc tga aacttct 4020 

gatgttgctg gatctgcaac aggtgcagtg tcttcaggtg atgaagcgga agt atcttat 4080 

ggtgtatggg ctaaaccttt ctataacatt gcagaacaag acaaaaaagg tgg-tatagct 4140 

ggttataaag caaaaactac tggggttgta gttggtttag atactctcgc tagcgataac 4200 

ctaatgattg gggcagctat tgggatcact aaaactgata taaaacacca aga"ttataag 4260 

aaaggtgata aaactgatat taatggttta tcattctctc tatatggttc cca acagctt 4320 

gttaagaatt tctttgctca aggtaatgca atctttacct taaacaaagt caa aagtaaa 4380 

agtcagcgtt acttcttcga gtctaatggt aagatgagca agcaaattgc tgc"tggtaat 4440 

tacgataaca tgacatttgg tggtaattta atatttggtt atgattataa tgcaatgcca 4500 

aatgtattag taactccaat ggcaggactt agctacttaa aatcttctaa tga-aaattat 4560 

aaagaaaccg gtacaacagt tgcaaataag cgcattaata gcaaatttag tga-tagagtc 4 620 

gatttaatag taggggctaa agtagctggt agtactgtga atataactga tat"tgtgata 4 680 

tatccggaaa ttcattcttt tgtggtgcac aaagtaaatg gtaaattatc taactctcag 4740 

tctatgttag atggacaaac tgctccattt atcagtcaac ctgatagaac tgctaaaacg 4800 

tcttataata taggcttaag tgcaaacata aaatctgatg ctaagatgga gtatggtatc 4860 

ggttatgatt ttaattctgc aagtaaatat actgcacatc aaggtacttt aaaagtacgt 4920 

gtaaacttct aa 4932 

<210> 757 
<211> 1311 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Rickettsia prowazekii 



<400> 757 

atgactaatg gcaataataa taacttagaa 
tttaagttac ctatacttaa agcaagtatc 
tctgcggaag ccgattactt tacttatgat 
tctactatca catatataga cggtgataaa 
aaagacttag ctgagaaaag tgatttttta 
ctaccaagta gtgatcagta ttgtaatttt 
aatgaaagat tacactattt atttcaaacc 
atgcttgcag ctgttggttc tctttcagca 
acagactatg aacttaccgc tattagaatg 



tttgcagaat taaaaattag agg-fcaaacta 60 
ggtaaagatg taatcgatat aag-tagggta 120 
ccgggtttta tgtctactgc ttc-ttgtcaa 180 
ggcatattat ggtatcgagg ata-tgatatt 240 
gaagtggcat atttgatgat ttatggggag 300 
actaaaaagg ttgctcatca ttcattagtg 360 
ttttgtagtt cttctcatcc tat<ggctatt 420 
ttctatcctg atttattaaa ttt-fcaatgaa 48 0 
attgctaaga tacctactat cgctgcaatg 54 0 
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tcttataaat attctatagg gcaaccgttt 

gaaaattttc tacatatgat gtttgcaact 
ataaaaaatg ctcttaataa gatatttatc 
acttcaacag ttcggattgc tggctcatca 
ggtattgcat cactttgggg gcctgctcac 
cttaaagaaa ttggcagttc tgagaatatt 
aatgatccat ttaggttaat gggttttggt 
gccgcagtac ttaaagaaac ttgtaaagaa 
aatccgctgt tacaaatagc aatagaactt 
attgaaagaa aattatatcc aaatgttgat 
ggtataccgt cgcaaatgtt cactgtactt 
gcacaatgga aagaaatgca cgaagatcct 
tacactggtt atgtacatag agagtataag 

<210> 758 
<211> 882 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vibrio cholerae 



atttatcctg ataattcatt a<gattttacc 600 

ccttgtacta aatataaagt aaatccaata 660 

ttacatgcag accatgagca gaatgcttct 720 

ggagctaatc cttttgcatg tattagcact 780 

ggcggggcta atgaagcagt gataaatatg 840 

cctaaatatg tagctaaagc taaagataag 900 

catcgagtat ataaaagcta tgacccgcgt 960 

gtattaaatg aattaggtca g-ttagacaat 1020 

gaagctctcg ctcttaaaga t«gaatatttt 1080 

ttttattcag gcattatcta taaagctatg 1140 

tttgcaatag caagaaccgt a<ggttggatg 1200 

gaacaaaaaa tcagtagacc tagacagctt 1260 

tgtattgtag aaagaaagtg a 1311 



<400> 758 

atgttcggat taggacacaa ctcaaaagag 
attcttgctg aaaaatttac ctttgatccc 
agtgaagaga tcattcgatt aggcagcaac 
cgtccaaacg aggtgatttc tcgcaatgat 
tttgaagtcg atgattccag cttaacccaa 
gattcgacaa agtccccaca atacgtcaaa 
gcccgagtgg aaacggttga agaagagatg 
tctcagccag aatctgtcaa tgaatacgca 
gtagtgaaca caccgcagcc agccaatgtt 
aatcgactgc ttattctgat agcggtctta 
ccgagccaaa ccagctttaa acccctaacg 
aataaccacc ctgatctttc aaactggcta 
aatgaaaagc atactggtgg gctcaagccg 
aaccagttaa cgctgaatta cattcacagc 
cgcatcgttg ctaaccctaa cgatgccatc 

<210> 759 
<211> 1095 
<212> DNA 

<213> Artificial Sequence 



atatcgatga gtcatattgg tactaaattc 60 

ctaagcaata ctctgattga caaagaagat 120 

gaaagccgaa ttctttggct gctggcccaa 180 

ttgcatgact ttgtttggcg agagcaaggt 240 

gccatttcga ctctgcgcaa aatgctcaaa 300 

acggttccga aacgcggtta ccaattgatc 360 

gctcgcgaaa gcgaagctgc tcatgacatc 420 

gagtcaagca gtgtgccttc a-fccagccact 480 

gtgacgaata aatcggctcc aaacttgggg 540 

cttcccctcg cagtattact gctcactaac 600 

gttgtcgatg gcgtagccgt caatatgccg 660 

ccgtcaatcg aactgtgcgt taaaaaatac 720 

atagaagtca ttgccacagg tggacaaaat 780 

cctgaagttt caggggaaaa cataacctta 840 
aaagtgtgtg ag 882 



<220> 

<223> Francisella tularensis 



<400> 759 

atgcttaaag ttggttttat tggttggcgc ggaatggtcg gctcagtttt aatgtctcgt 60 

atgatcgaat caaaagattt tgattgtatt ttgccaacgt ttttttcgac atctcaggta 120 

gggcagctgc caacaggttt tatgcaacaa tatggagcgt tacaagatgc ctatagtatc 180 

gaccaactaa gtagtatgga tatacttcta agttgccaag gtggtgaata taccaaagaa 240 

atacaccaca aattaagaga agccggctgg caaggtttct ggatagacgc tgcatcgaca 300 

ctacgcttag acaaagatag tactctagtt ctagaccctc taaatcacga tcaaataatt 3 60 

aatgctattg ataatggtaa aaaagatttt atcggtagta attgtactgt tagtctaatg 420 

tcactagcta tagctggact actcaaagaa gatcttgttg aatgggttaa ctctagtact 480 

tatcaagcaa tttcaggagc gggtgccgca gcaatgcaag aactacttca acaaacaagc 540 

cttttaagca aaattgataa tagagatgaa gatattctaa ttagagaaaa aattctcaga 600 

gaattatcaa aagactcatc aaaaatccct caacaaaaaa ctgtacaaac tttggcttat 660 

aatctattac cttggataga tgttggtatg cctagtggac aaacaaaaga agagtacaaa 720 
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gcagctacag aacttaataa aattctagat actaaaaaaa caatccctgt 

tgtgtcagag taccaagtct aagatcacac tctcaagcat taacagtcaa 

aaattaacaa ttgaagaaat taagcaaaaa atatctcaag gtaatgaatg 

atagataata acaaagaaga tactttaaaa tacctaacac ctcaagctaa 

cttgatattg ctataggtcg tatcaaatca tcgttattag ctgatgatat 

ttcacagtag gtgatcagct attatgggga gctgctgagc cccttagaag 

attattaaaa tataa 



cgatggtata 780 
acttagacag 840 
ggttaaagta 900 
ttcaggaact 960 
atttcattgt 1020 
agttttaaat 1080 
1095 



<210> 760 
<211> 1020 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Francisella tularensis 



«J00> 760 

atgaataaaa 

gaacttcttg 

tctgtaatag 

cttttaggta 

cattttgctg 

aacaatatcc 

tttgtcttta 

gatatgcctc 

ttgcgagatt 

ccagtcggcg 

ctcatgcctt 

ggtgactatg 

gcaataggtc 

tataatcttg 

gccctaggta 

agttttgctg 

gatgatattt 



aaatcttagt 
atagagatta 
acagggttaa 
aagctaagct 
gctttaaagc 
aaggtacact 
gttcatcggc 
taagtacaac 
tgcaaaatgc 
cccatagtag 
atgtcgcgca 
agactataga 
atatattagc 
gttctggaaa 
aagagattcc 
atgttgccaa 
gtgatgatat 



aacaggtggt 
tcaagttgtg 
aaaaatcaca 
aacaaaagtt 
tgtaggtgag 
aaacttactt 
gactgtatat 
taacccatac 
taataataat 
tgggatgata 
agtaggtgct 
tggtacagga 
gttagaaaaa 
tggctattct 
atatcagata 
ggctaaaaga 
gcttaaatgg 



gtaggctata 
gtggtagata 
aataaagatt 
tttcaagagt 
agtgttgaaa 
gagctaatgc 
gggatgaata 
ggtgcaacta 
tttaatatta 
ggagaggatc 
ggtaaactag 
gtgagagact 
ttatcacaag 
gtattagaga 
gtagctagga 
gagttgggtt 
caaaagtacg 



taggtagtca 
atctttcaaa 
ttgattttta 
atgatattta 
aaccgttaga 
aagagtataa 
ataaaccacc 
agctaatgtt 
catgtcttag 
cacagggtat 
ctaaacttag 
atatacatgt 
ataagcctag 
ttgtcaaagc 
gagccggtga 
ttgagacaca 
caaaagagaa 



tacagtggta 
tagcaaagta 

tgctgtaatt 
gtattatcat 
agtttataat 
ctttacagaa 
agaagacatt 
atattttaat 
acctaataac 
tatctttggt 
tgtagattta 
ctggagagct 
ttatcaaaaa 
tattgcagcg 
aaagsctata 
taatatctag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



<210> 761 

<211> 840 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Shigella flexneri 



<400> 761 

cgccccctgg 

cagatatggc 

cgcctttccg 

tggctggaaa 

gatgccactg 

ctcctggtcc 

ctgggcaggg 

actctccatt 

aaacttcagc 

gcaaatgacc 

gactggttct 

tgggcgcagg 



ctgatgccgt 
atgcttttga 
ataccgtctc 
aactcagtgc 
agagctgtga 
atcaggcatc 
aaatgttccg 
ttgtggatga 
tctccactgc 
tccgcactgc 
ccctctgggg 
cagaagagca 



gacagcatgg 
acatgaagag 
tgcacgcaat 
ctctgcggag 
ggaccgtgtc 
agaaggcctt 
cctcgaaatt 
gatagaagtc 
cgtgaaggaa 
cgaagccatg 
accatggcat 
gaaatatgag 



ttcccggaaa 
cacgccaaca 
acctccggat 
cttcgacagc 
gcgctcacat 
ttcgataatg 
ctggaggaca 
tacctggcct 
atgcgtttct 
gtcagaagcc 
gctgtactga 
atgctggaga 



acaaacaatc 
ccttttccgc 
tccgtgaaca 
agtctttcgc 
ggaacaatct 
ataccggcgc 
ttgcccggga 
tccagaccat 
atggcgtgtc 
gtgaagagaa 
agcgtacgga 
atgagtaccc 



tgatgtatca 60 
gttccttgac 120 
ggtcgctgca 180 
tgttgctgct 240 
ccggaaaacc 300 
tctgctctcc 360 
taaagtcaga 420 
gctcgcagag 480 
gggagtgaca 54 0 
tgaatttacg 600 
agctgaccgc 660 
tcagagggtg 720 



gctgaccggc tgaaagcatc aggtctgagc ggtgatgcgg atgcggagag ggaagccggt 
gcacaggtga tgcgtgagac tgaacagcag atttaccgtc agctgactga cgaggtactg 



780 
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<210> 762 
<211> 503 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Campylobacter jejuni 



<400> 762 

aaaaacttaa aaaaagctgt tttttacttg atattgtttt ttaaatatgc taaaattagg 60 

cgtttcaatt aaaacaaagg agcttttatg actaaagcag atttcatttc attagttgct 120 

caaacagctg ggctaacaaa aaaagacgct actactgcta ctcjatgcagt tatttctact 180 

attactgatg ttttagctaa aggtgatagc atcagtttta ttcjgttttgg tactttttca 240 

actcaagaaa gagctgctag agaagctaga gtaccaagca cacjgaaaaac aatcaaagtt 300 

cctgctacaa gagttgcaaa atttaaggta ggtaaaaacc ttaaagaagc tgttgcaaaa 360 

gcaagcggca aaaagaaaaa ataaaacttc ggctagataa atfcctagcct tctttcttaa 420 

ttaattcaga tattttgtat cttttattta ctttattttt ttaaactttt ataaaactat 480 
cattattaaa caaaaaagga tat 503 



<210> 763 
<211> 2118 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Concatenation of A. baumannii genes 



<220> 

<221> misc_feature 

<222> 447-486, 778-817, 1162-1201, 1495-1534, L928-1967, 2115-2118 
<223> n = A,T,C or G 



<400> 763 

cgcgcggtaa aactaaagaa gaagatatag 
aagagattgc tgaacattta atgctgattg 
cgaaaatagg taaagtccaa gtcacggatc 
tgcatattgt ttcaaatgta caaggtgaag 
ttaaagccac ctttccagca ggaacgttat 
ttattgatga agtagaacct gtgaaaaggg 
gatggcatgg tgaaatggat atgtcgattg 
aggtgtatgt acaggctggt gcagggnnnn 
nnnnnnggaa tctggcggtt tagtttcaga 
tattgctcaa cctgactgcg tgaatggttg 
tcaagcagaa gctttggaaa aagaagggat 
acctgatgaa gaaatcgtaa aacgtctttc 
tgtttatcac gttgtataca atccacctaa 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
aaccttaaaa ctggtaaagt tttagaaaaa 
gctgacatcg tagaagtaga aatgaactac 
atggacccag aaagcttcga acaaattgca 
aaatggttaa aagacgactc aaatgaaaca 
ttaaacgtaa atgcacctaa cttcgttgta 
cgtggtgata cttctggtgg tnnnnnnnnn 
ntcgtgcccg yaatttgcat aaagctgccg 
aaactctaga agaatggatt gcactacccg 
tgtctttagg tttacgtcag tatggcgtga 
cccgtttctt tgccattgaa gatgacttaa 
aactggctga agagctttgt cccacccaac 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
caagattgcc aatgatattc gttggttagc 



cattagaaaa agatttgctg tctgatgaaa 60 
atcttgggcg aaacgatgta gggcgtgtat 120 
aaatggtgat cgagcgttat tcacatgtca 180 
tgcgtgatga tatcgatgca cttgatgtat 24 0 
caggtgcccc aaaaattcgt gcaatggaaa 300 
gagtttttgg cgcjggctgtt ggttatttgg 360 
caatccgtac ttcjtgttatc cgtgataaaa 420 
nnnnnnnnnn nnannnnnnn nnnnnnnnnn 480 
tgaactcatt atcrggtttag taaaagaacg 540 
tattttcgac ggcttcccac gcactattcc 600 
cagcattgat cafcgtaattg aaattgatgt 660 
tggtcgtcgt cagcatcctg cttctggtcg 720 
agtggaaggt aaagatgatg tcacaggnnn 780 
nnnnnnncgt tcaaccgtgt aaaattacgt 84 0 
acttttaaat ctcjgtgatac tttagaagct 900 
ctatacaacg atcjgcgaaat gtggcacttc 960 
gctgacaaaa ctcjcaatggg tgatgctgct 1020 
tgtacaatca tgfctattcaa cggcgttcct 1080 
ttgaaagttg ttgjaaactga tccgggcgta 1140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200 
gccttgtagc acagcaaggc aaatttcctg 12 60 
gcattggtcg ctc:gaccgca ggtgcactca 1320 
ttatggatgg caacgtgaaa cgcgttttag 1380 
gcaaaccaca gcacgaacgt gaaatgtgga 1440 
gcaatcatga ctacactcaa gcgannnnnn 1500 
nnnnttaaaa acactagcgg taagcttaaa 1560 
aagtggtcca cgttgcggct tcggcgaaat 1620 
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ccgtattcct gaaaatgaac ctggttcaag tatcatgcca ggtaaag-tga acccgactca 1680 

aagtgaagcc atgaccatgg ttgttgctca agtacttggc aacgatacca ctattaatgt 1740 

cgctggtgct tctggtaact tcgagctcaa tgtatttatg ccagtga-ttg cttataactt 1800 

actgcaatct attcagttgc ttggtgatgc atgtaatagt tttaatgatc actgtgcagt 18 60 

agggatcgag ccaaatcgtg agaaaattga tcatttcttg cataattctc ttatgttagt 1920 

tacggcannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnccc ggttatgtac 1980 

caaatacttt gtctgaagat ggtgacccat tagacgtact tgttgtaact ccacatcctg 2040 

ttgctgccgg ttctgtaatt cgttgccgcc cagtgggcaa attaaacatg gaagacgacg 2100 
gtggtatcga tgccnnnn 2118 

<210> 764 
<211> 276 
<212> DNA 

<213> Acinetobacter baumannii 
<400> 764 

atgagcgagc taggcttaaa aagcagtggc aagccaaaaa aatcagcgcg tacagtgggt 60 

gatgtacttg gtaaatacca cccacatggt gactcggcat gttatgaagc catggtactc 120 

atggctcagc catttagtta ccgctatcct ttaatcgaag gtcagggcjaa ctggggttca 18 0 

cctgatgatc ctaaatcttt tgctgcgatg cgttataccg aagccaaaot ctcggcttat 240 

agtgaattat tgctgagcga attaggtcag ggcact 27 6 

<210> 765 
<211> 9610 
<212> DNA 

<213> Yersinia pestis 
<400> 765 

tgtaacgaac ggtgcaatag tgatccacac ccaacgcctg aaatcagatc cagggggtaa 60 

tctgctctcc tgattcagga gagtttatgg tcacttttga gacagttatg gaaattaaaa 120 

tcctgcacaa gcagggaatg agtagccggg cgattgccag agaactgcjgg atctcccgca 18 0 

ataccgttaa acgttatttg caggcaaaat ctgagccgcc aaaatat acg ccgcgacctg 24 0 

ctgttgcttc actcctggat gaataccggg attatattcg tcaacgcatc gccgatgctc 300 

atccttacaa aatcccggca acggtaatcg ctcgcgagat cagagaccag ggatatcgtg 360 

gcggaatgac cattctcagg gcattcattc gttctctctc ggttcctcag gagcaggagc 420 

ctgccgttcg gttcgaaact gaacccggac gacagatgca ggttgac"tgg ggcactatgc 480 

gtaatggtcg ctcaccgctt cacgtgttcg ttgctgttct cggatacagc cgaatgctgt 54 0 

acatcgaatt cactgacaat atgcgttatg acacgctgga gacctgccat cgtaatgcgt 60 0 

tccgcttctt tggtggtgtg ccgcgcgaag tgttgtatga caatatgaaa actgtggttc 660 

tgcaacgtga cgcatatcag accggtcagc accggttcca tccttcgctg tggcagttcg 720 

gcaaggagat gggcttctct ccccgactgt gtcgcccctt cagggcacag actaaaggta 780 

aggtggaacg gatggtgcag tacacccgta acagttttta catcccacta atgactcgcc 84 0 

tgcgcccgat ggggatcact gtcgatgttg aaacagccaa ccgccac«ggt ctgcgctggc 900 

tgcacgatgt cgctaaccaa cgaaagcatg aaacaatcca ggcccgtccc tgcgatcgct 960 

ggctcgaaga gcagcagtcc atgctggcac tgcctccgga gaaaaaa<gag tatgacgtgc 1020 

atcttgatga aaatctggtg aacttcgaca aacaccccct gcatcat oca ctctccatct 1080 

acgactcatt ctgcagagga gtggcgtgat gatggaactg caacatoaac gactgatggc 1140 

gctcgccggg cagttgcaac tggaaagcct tataagcgca gcgcctgogc tgtcacaaca 1200 

ggcagtagac caggaatgga gttatatgga cttcctggag catctgc~ttc atgaagaaaa 12 60 

actggcacgt catcaacgta aacaggcgat gtatacccga atggcagcct tcccggcggt 1320 

gaaaacgttc gaagagtatg acttcacatt cgccaccgga gcaccgcaga agcaactcca 1380 

gtcgttacgc tcactcagct tcatagaacg taatgaaaat atcgtat-tac tggggccatc 1440 

aggtgtgggg aaaacccatc tggcaatagc gatgggctat gaagcag-tcc gtgcaggtat 1500 

caaagttcgc ttcacaacag cagcagatct gttacttcag ttatcta egg cacaacgtca 1560 

gggccgttat aaaacgaege ttcagcgtgg agtaatggcc ccccgcctgc tcatcattga 1620 

tgaaataggc tatctgeegt tcagtcagga agaagcaaag ctgttct-tcc aggtcatege 1680 

taaacgttac gaaaagagcg caatgatcct gacatccaat ctgccgt-tcg ggcagtggga 17 40 

teaaaegtte gccggtgatg cagcactgac etcagegatg ctggaccgta tcttacacca 1800 

ctcacatgtc gttcaaatca aaggagaaag ctatcgactc agacagaaac gaaaggcegg 18 60 

ggttatagca gaagctaatc ctgagtaaaa cggtggatca atattgggee gttggtggag 1920 
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atataagtgg atcacttttc atccgtcgtt 
tgaataacaa gaatgccgga gatacgcagt 
caaggtcgta ggtcgttata ggaaaattct 
ggccatgaac gactgacaac attacgaata 
ctactgatta aaccagccgg acttgtccac 
tgagtacaga tacatgtcac gatgatgcag 
ggaactgtgg tgaaccatag ctcaatattc 
ttcgttgtaa cgcgattttc cgtaccgggc 
cgtcagaaac gttctgagcg cgtttccggc 
cagttgacag ccggaaacac gcgggtgtcg 
agcacaaaaa acagatgttg tactcagtca 
gaaaaagtac cgtactttca ggaatgtcca 
ttccgggtga atttttttgt catccgttca 
aggatttttt .atgtggcgtt tactacaggc 
aacaggcgga acccggacag ggggagaacg 
cccatcgttg ctactgcaac gggatgaatt 
atttactcac actccacata tcactgacgg 
aaactgcgct gaatatggcg cgatttatca 
tggatgctct ggatgccgac gagcaggcgg 
aagaactcca gaacagcatc caggctcgct 
gaagctcccg gagacggtca cagcttgtct 
tcagggcgcg tcagcgggtt ttagcgggtg 
cgatagcgga gtgtatactg gcttagtcat 
caccatgtac ggtgtgaaat gccgcacaga 
tcttccgctt cctcgctcac tgactcgctg 
tctgctcact caaaagcggt gatactgtta 
aacatgtgag caaaaaacga agaccccaga 
ggctccgccc ccctgacgag catcacaaaa 
cgacaggact taaagatacc aggcgtttcc 
tccgaccctg ccgcttaccg gatacctctc 
ttctcatagc tcacgctgtt ggtatctcag 
ctgtgtgcac gaaccccccg ttcagcccga 
tgagtccaac ccggtaagac acgactttac 
agtggattta gatacgcaga actcttgaag 
acagcatttg gtatctgtgc tccacttaag 
acttctaacc ttcgaaagaa cccacgcctg 
agattacgcg cagaaaaaaa ggatctcaag 
gctcccgatc agttcagcag aagattatga 
cccatgttac ttgaggttgt atgtagtctg 
ctcttttttt tcaatatgca attggaagtt 
aatttattta tgtaagaaca ggatgggagg 
tctcatattt ttatcatcaa gtggccttgc 
cttgcaaaac ctagaattaa atacctttgg 
acagacaacc ttcaagcaaa ccgagtttac 
tgcacttatc aataaagaca actcatggat 
agatgagtat actgtctgtt ttgaagattt 
catacatcct ctacttataa aaaaagtaaa 
aaagaaatct atccctggtt gcactgtata 
tgaggggctc atggttattt taacaatcca 
tctcgtttac gtctgtcaga cattccatca 
ttccagtctt ttgcgataac tttattccat 
cctttttgat atccaataga caccagtgct 
cgaagaccga tattagcgtc attgaaaaga 
taaatattag ataagagatg agcttcatta 
ttatcataag cctccttccc tcgaagcata 
tgagggacac cgtcattcaa taaatcctgt 
ccgaatgtaa caccggtact gttaaaataa 
gtcggattaa acccttcaaa accattactg 
gaacgacgta aaaattcctt cagttgacta 
gctaggacga tgcgatttcg gttattattc 
tgttcaatag attcggcaat cgttccctca 



gacaccctga tgaat-fccacg tgttcacgcc 1980 

catatttttt acacaattct ctaatcccga 2040 

tagcaccatt ccggaacaat cagaacagca 2100 

taaaaaacgc acccgcggcca gacattcccc 2160 

ggaacggtct ttttasaccg acacacagtc 2220 

gattagcgga agagtcgtgag cacgtttccg 2280 

gagtgagggc ataccejgaaa cgcgctcaga 2340 

aattttttca gttgtttttt cgtttcatgt 2400 

atctgatgct acgcaaacca tccccatggt 24 60 

ttttagcgta tcgaccjggac ggcgtcgaga 2520 

gttgttttac agacacgcact gcggcagatt 2580 

gaaaccatgt gtcagacttc gttctccccc 2640 

ggaatctctt tataacgatt actccatttc 2700 

aggatattca aaggcaaaaa aatcccccgg 27 60 

aatcgctaaa taatt-fctcgt agttgtattt 2820 

tgccgcagtt tatcc-fcgtaa aacaatcctg 2880 

agcacaacgg aatag"tgaac aaacaacaac 2940 

gaagccagag cctga-fcactg cttgaaaaac 3000 

ccatgtgtga acgactgcac gaactcgcgg 30 60 

ttgaagccga aagtgaaaca ggaacataac 3120 

gtgaacggat gccggcgagca gacaagcccg 3180 

tcggggcgca gccatgaccc agtcacgtag 3240 

gcggcatcag tgcggattgt atgaaaagtg 3300 

tgcgtaagga gaaca-fcgcag atgccgatgc 3360 

cgctcggtcg ttcggctgcg gcgagcggtg 3420 

tccacacaat caggggataa cgccggaaag 34 80 

aaaggccgcg ccggacggcgc tttttccata 3540 

atcgacgctc aagtcagagg tggcgaaacc 3600 

ccccggaagc tccctcgtgc gctctcctgt 3660 

cgcctttctc ccttcgggaa gcgtggcgct 3720 

ttcggtgtag gtcgttcgct ccaagctggg 3780 

ccactgcgcc ttatccggta actatcgtct 3840 

gccactggca gcagccattg gtaactgaaa 3900 

ttgaagcctt atcgcggcta cactgaaagg 3960 

ccagctacca caggt-fcagaa agcctgagaa 4020 

agaacgtggg ttttttcgtt tacaggcagc 4080 

aagatccttt gatcttttct actgaattgc 4140 

tggggttcta tgggtattgc tgcggtaaca 4200 

tgtagaatta tacacataag gcttaaactg 42 60 

cattgactac ataaatagat tattccaaat 4320 

gggaatgatc tcaaagttat tttgcttggc 4380 

agaaaaaaac acatatacag caaaagacat 4440 

caattcattg tctcatggca tctatgggaa 4500 

aaatattaaa agcaacacca aaaaacacat 4560 

gatatcatta aaaatactag gaattaagag 4 620 

ctctctaata agacccjccaa catatgtagc 4 680 

atctggaaac tttatagtag tgaaagaaat 47 40 

ttatcattaa tagcaagccc ctcattatta 4800 

ctatcgatat ctttttgcac cagagcgccc 4 8 60 

acaatattat taaaagcatt tacaaggcca 4 920 

actgtgggag cagttctgga taacttaaac 4 980 

gtacgggttc tcaacggtaa atcgctgaac 50 4 0 

ccttcaatct tatgtcgagaa tttatcaata 5100 

tcagaaagcg tcagacjgtgc tgttctcact 5160 

taatacccat caagtctatc tgcaatatac 5220 

ttgcttcgct gaccaaggtc aaccccggaa 52 80 

tcgctactag gattacgacgg aaaatgactt 5340 

gagaaaatat cgtggtcaac aatatttacc 5400 

atattgtcaa agttastgac agtgttgtcc 54 60 

agaatgtctt cgttc"fc;cttt cttatcgaga 5520 

agaaccatga cacggtagac tttcacaccg 5580 
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tctttttcct gacctgtttc aacagttatt 
tttgaaattt tactttcctg gcggatctta 
gtatcaatcg gaaaccccat aatgtacatc 
acataatgtg gtaatgcaat tgtaatcgaa 
aatgtacttc cggaaaagag aaaagccgga 
tctgacataa aaattcctct ttaacacata 
acataaaacc actgttttta tgtacagtaa 
ataatatgaa atttcatttt tgtgatctga 
ttcgcagcag aaaaacagca ccgggtaaca 
gcaaaacaaa atctggtcgc caccgactga 
ttgtcgttgc cggatacgaa gaacggacag 
cactgacaca ggccgacctg cgccatctgg 
cacaggccta cgatgcgatt taccaggcga 
gcggaaaaaa accggaaggg gtggaacccc 
aactcccgtt atcagtacca tcggctcaac 
aaagatcata tttccctgga tattttccac 
ctgtcagacg acgagaaaac ggatatcgat 
tgaaattaga taatcagata caaataatat 
tctaatgaag aaaagttcta ttgtggcaac 
tgcagcatca tctcagttaa taccaaatat 
caccgggatg ctgagtggaa agtctcatga 
gatcagccag ttagactgga agatcaaaaa 
ggatccatac tcatttctga ccctgaatgc 
aggtaatatg gatgactacg actggatgaa 
atctcatcct gctacaaatg ttaatcatgc 
gttactccag gatgagaatt ataaagcagg 
cagttggaca gctacaggtg gttcatatag 
cccgaaagga gtgcgggtaa taggttataa 
tgcaggccag tatcgcatta atgattttga 
ggttcgggca catgataatg atgagcacta 
atccggctca cgttattatg gtaccgtaat 
caaagtcttt gcggaattta catacagtaa 
cattgataag aatagtggag attctgtctc 
taaaaattat actgtgacgg cgggtctgca 
tctcttttca tcctccccta gcggggagga 
aaccttcaga tgtgtgaaaa atcacctttt 
gttttgcctt gacattctcc acgtctttca 
acgctactct ccttcctgta ggaagctcaa 
caacgtaacg ctttaacgtc gccagcttta 
ctgctggctg gcttataccc atcgcctcag 
cacgcatcat ctgcaagccg acctcaagaa 
tctggctttc aggtgaacga ctggcaatca 
cagtgtgttc agatgtgctg taaattcatc 
ccgcttatca ttacttttat ctcctgcaca 
cgcataaaag gctcttatcg gacggccaga 
gtaccgagaa cctttcacgg tatcggcata 
tagctttttc aaatcagcca aaaccttttc 
gtcgtcaaat cgctggctaa aaagtaccat 
accccactaa caatataacc tacgagttat 
acggcaattt gtacgcacca ctgaaatgcg 
aaatagcaac aaacctcccg aaaaaccgcc 
gcatatctat gcagccaggc gaatcacgaa 
gtgttttttc ctgatttggt gggcgtggaa 
tgccagatga gcgcgatctg gcaattaagg 
cgcgcccgga tatgttttaa cgcgattttc 
ccattcaccg ggctgatggt gaatacatgc 
ccacagctct gtctgttggc agctttcgcc 
aaggcctctg aaccagaaag caacaaaaca 
gtctgatgga ttgctgcaaa aaaaagtccc 
atagcgtcta aattaaagta agaatacgac 
ccgcacagaa taatcaacag taaaaacaaa 



ttctgttcgt aagacacggt cccttcagtt 5 64 0 

tttgaatatt cactgtcttt ctccatctcc 57 0 0 

agtttaaaat tactccggcc aggcagatcc 57 60 

ttagcttcaa aatttggtct gtaactgctt 5820 

acaccacctg aaccattcac taccattgta 5880 

aaaaaacaat aagttaaaaa aaaatactgt 5940 

taaaattacg ccgctttatt ttctctgtca 6000 

atcactctta taaaaatcag gaagggaaga 6060 

tcagaaaaaa acagaaagga gataacgtga 6120 

gcaaaacaga caaacgcctg ctggctgcac 6180 

cccgtgacct catccagaaa cacgtttaca 624 0 

tcagtgaaat cagtaacggt gtgggacagt 6300 

gacgcattcg tctcgcccgt aaatacctga 6360 

gggaagggca ggaacgggaa gatttaccat 6420 

gctcgttgtc ggatctgaaa aattcgctca 6480 

cgtttcttat g-fcgagcaaag tcacataatt 6540 

tattgtttaa tatttttaca ttattaaaaa 6600 

gttttcgttc a-fcgcagagag attaagggtg 6660 

cattataact a-fctctgtccg ggagtgctaa 6720 

atcccctgac acgctttacag ttgcagcctc 6780 

aatgctttat gacgcagaaa caggaagaaa 684 0 

tgtcgctatc c-fcgaaaggtg atatatcctg 6900 

cagggggtgg acgtctctgg cttccgggtc 6960 

tgaaaatcaa tctgagtgga cagatcactc 7 02 0 

caatgaatat gacctcaatg tgaaaggctg 7080 

tataacagca gcjatatcagg aaacacgttt 7140 

ttataataat gcgagcttata ccggaaactt 7200 

ccagcgcttt tctatgccat atattggact 7260 

gttaaatgca ttatttaaat tcagcgactg 7320 

tatgagagat c-ttactttcc gtgagaagac 7380 

taacgctgga tattatgtca cacctaatgc 744 0 

atatgatgag gcjcaaaggag gtactcagac 7500 

tattggcgga gatgctgccg gtatttccaa 7560 

atatcgcttc tgaaaaatac agatcatatc 7 620 

tgtctgtgga asggaggttg gtgtttgacc 7 680 

tcaccataat gacggggcgc tcattctgtt 7740 

gggcatggag aaggtcaaat tagacatgga 7800 

catccaagct taatttgcct cccattgctt 7860 

aatcatttcc gcgctgctcc agctttgtta 7 920 

caacttgttt t"tgtgataac tggagttctt 7 980 

tcatctcatc tgccatttct ttaattcgtg 8040 

cctcatctaa tgttctcatt acttgctctc 8100 

ctcagctata cgcaccagtt tttcataaaa 8160 

aagaacgata gcccgacgaa tcggatcgaa 8220 

aaactgaacg ogaagctctt tcatattttt 8280 

tggcctgggt aactcaggtc cgtaaacctg 8340 

ctgaagagcg tcttcttgct catttagcca 8400 

ccacatgctc aaccctataa cctgtagctt 8 4 60 

attttcaaga aaagctggct atttaacata 8520 

ttcagcgcga tcacggcaac agacaggcaa 8580 

gcgatcgcgc ctgataaatt ttaaccttat 8 64 0 

cgaattgcct gcctgatgta actgaaacgg 8700 

gacggaacat gaacgggaaa acagaattca 87 60 

caaaacacag caacaaagao acgccagaat 8820 

agactcagac aaattcagca gaatgctact 8880 

gtatccagga tcgagtacatt tctggctctg 8 94 0 

tgtccggaaa cctgcttaaa acgctcccga 9000 

caggccatta acjtaaatcgc gttaaaacac 9060 

taatggagca gggactgtta aacccagtga 9120 

caggtactct tcagaaaaga gattaatcca 9180 

caaccctgat t"ttttatttt tctttttttc 9240 
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gataaaaaca aaattaaaga aataattaat 

tgtgttccat tttgtgatta gtctgaaact 

tgctcacata aagcccctcc ttcaggcaga 

ggccctcaca ggaggtgttc tgtgagggcg 

gtctagtcct gacttttgca tctccgaata 

aaaaataaaa atgtgacatc gcaatgccag 
ccgacccctg 



cagaacattc ctt aacttca gggcattgcc 9300 
tccgaaggtg gat aacaccc ggtatttttt 93 60 
ggggcttttt ctt-tgccacc acataaaaaa 94 20 
tatgataagg actgaatcga tggttaatat 94 80 
taaaaccctg tttaacggca tgcaaaacca 9540 
ataatattga cgcatgaggg aatgcgtacc 9600 
~ ~ 9610 



<210> 766 
<211> 102 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 766 

tttaagtccc gcaacgagcg caacccttga tcttagttgt ttagttgggc actctaaggt 60 
gactgccggt gacaaaccgg aggaaggtgg ggatgacgtc aa 102 

<210> 767 
<211> 94 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Calibration Polynucleotide 
<400> 767 

tagaacaccg atggcgaagg cgactttctg gtctgtaact gacactgaga aagcgtgggg 60 
agcaaacagg attagatacc ctggtagtcc acga 94 

<210> 768 
<211> 108 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 768 

tggattagag accctggtag tccacgccgt aaacgatgag tgc"taagtgt tagaggcctt 60 
tagtgctgaa gttaacgcat taagcactcc gcctggggag tacggcca 108 

<210> 769 
<211> 108 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 769 

tttcgatgca acgcgaagaa ccttaccagg tcttgacatc ctc"tgacaac cctagcttct 60 
ccttcgggag cagagtgaca ggtggtgcat ggctgtcgtc agctcgta 108 

<210> 770 
<211> 95 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Calibration Polynucleotide 
<400> 770 

tctgacacct gcccggtgct ggaaggttaa ggagaggggt tagcgtaact ctgaactgaa 60 
gccccagtaa acggcggccg taactataac ggtca 95 

<210> 771 
<211> 117 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 771 

tctgttctta gtacgagagg accgggatgg acgcaccggt accagttgtt ctgccaaggg 60 
catagctggg tagctatgtg cggaagggat aagtgctgaa agcatctaag cacgaaa 117 

<210> 772 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 772 

tgattattgt tatcctgtta tgccatttga gatttttgag tggtattgga gttattgttc 60 
caggattaat tgcaaataca attcaaagac aagggfctaca 100 

<210> 773 
<211> 112 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 773 

tcgaagtaca atacaagaca aaagaaggta aaattactgt tttaggggaa aaattcaaga 60 
aatatagaag tgatggctaa aaatgtagaa ggggtcttga agccgttaac aa 112 

<210> 774 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 774 

ttgctcgtgg tgcacaagta acggatatta caatcattgt tgttgcagct gatgacggcg 60 
taataaacag ttgaagcaat taaccatgcg aaagcagcaa 100 

<210> 775 
<211> 114 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 775 

tagcttttgc atattatatc gagccacagc atcgtgatg-fc tttacagctt tatgcaccgg 60 
aagcttttaa tggataaatt taacgaacaa gaaataaatc tatccttgga agaa 114 

<210> 776 
<211> 116 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 776 

tgacctacag taagaggttc tgtaatgaac cctaatgacc atccacacgg tggtggtgaa 60 
ggtagatctc ctatcggaaa gtccacgtac tccatggggt aaaccagcac ttggaa 116 

<210> 777 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 777 

tccacacggt ggtggtgaag gtagatctcc tatcggaaag tccacgtact ccatggggta 60 
aaccagcaca 70 

<210> 778 
<211> 82 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 778 

ttatcgctca ggcgaactcc aacctggatg atgaaggccg ctttttagaa ggtgacttgt 60 
cgtagcaaag gcgaatccag ca 82 

<210> 779 
<211> 87 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 779 

tgggcagcgt ttcggcgaaa tggaagtggc tcgaagcgta tggcgcttcg tacgtgctgc 60 
aggaaatgtt gaccgtcaag tcggaca 87 

<210> 780 
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<211> 97 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 780 

tcaggagtcg ttcaactcga tctacatgat ggccgaccgc ccggggttcg gcggtgcaga 60 
ttcgtcagct ggccggcatg cgtggcctga tggcgta 97 

<210> 781 
<211> 117 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 781 

tctggcaggt atgcgtggtc tgatggccaa tccatctggt cgtatcatcg aacttccaat 60 
caagtttccg tgaaggttta acagtacttg agtacttcat ctcaacccac ggtgcga 117 

<210> 782 

<211> 98 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 782 

tcaagcaaac gcacaatcag aagctaagaa agcgcaagct tctggaaagc acaaatgcta 60 
gttatggtac agaatttgca actgaaacag acgtgcaa 98 

<210> 783 
<211> 99 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Polynucleotide 
<400> 783 

tccacacgcc gttcttcaac aactaccgtg ttctacttcc gtacgacgga cgtgacgggc 60 
tcgatcgagc tgccgaagga caaggaaatg gtgatgcca 99 

<210> 784 
<211> 111 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Calibration Sequence 
<400> 784 

tcgtggcggc gtggttatcg aacccatgct gaccgatcaa tggtacgtgc acaccgcccc 60 
ccaaagtcgc gattgaagcc gtagagaacg gcgacatcca gttcgtaccg a 111 
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<210> 785 
<211> 2100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Combination Calibration Polynucleotide 



<400> 785 

gaagtagaga tatggaggaa caccagtggc gaaggcgact ttctggtctg taactgacac 60 
tgagaaagcg tggggagcaa acaggattag ataccctggt agtccacgcc gtaaacgatg 120 
agtgctaagt gttagaggcc tttagtgctg aagttaacgc attaagcact ccgcctgggg 180 
agtacggccg caaggctgaa actcaaagga attgaccgggg cacaagcggt ggagcatgtg 240 
gtttaattcg aagcaacgcg aagaacctta ccaggtcttg acatcctctg acaaccctag 300 
cttctccttc gggagcagag tgacaggtgg tgcatgcgttg tcgtcagctc gtgtcgtgag 360 
atgttgggtt aagtcccgca acgagcgcaa cccttgatct tagttgttta gttgggcact 420 
ctaaggtgac tgccggtgac aaaccggagg aaggtgcggga tgacgtcaaa tcatcatgcc 480 
ccagtaccgt gagggaaagg tgaaaagcac cccggaaggg gagtgaaaga gatcctgaaa 540 
ccgtgtgcca tagtcagagc ccgttaacgg gtgatgcgcgt gccttttgta gaatgaaccg 600 
gcgagttata agatccgtag tcaaaaggga aacagcccag accgccagct aaggtcccaa 660 
agtgtgtatt gaaaaggatg tggagttgct tagacaacta ggatgttggc ttagaagcag 720 
ccaccattta aagagtatag ggggtgacac ctgccccggtg ctggaaggtt aaggagaggg 780 
gttagcgtaa ctctgaactg aagccccagt aaacggoggc cgtaactata acggtcctaa 840 
ggtagcgaaa gaaatttgag aggagctgtc cttagtacga gaggaccggg atggacgcac 900 
cggtaccagt tgttctgcca agggcatagc tgggtacjcta tgtgcggaag ggataagtgc 960 
tgaaagcatc taagcatgaa gcccccctca agatgagagc agtaaaacaa gcaaacgcac 1020 
aatcagaagc taagaaagcg caagcttctg gaaagcacaa atgctagtta tggtacagaa 1080 
tttgcaactg aaacagacgt gcatgctgtg aaatttcjcga aagcttttgc atattatatc 1140 
gagccacagc atcgtgatgt tttacagctt tatgcaccgg aagcttttaa tggataaatt 1200 
taacgaacaa gaaataaatc tatccttgga agaacttaaa gatcaacgga tgctggcaag 1260 
atatgaaaaa taagataaaa cagcactatc aacactcjgag cgattcttta tctgaagaag 1320 
gaagagcgat gaaaacaacg aagtacaata caagacaaaa gaaggtaaaa ttactgtttt 1380 
aggggaaaaa ttcaagaaat atagaagtga tggctaeiaaa tgtagaaggg gtcttgaagc 1440 
cgttaacagc tgttatggcg accgtggcgg cgtggttatc gaacccatgc tgaccgatca 1500 
atggtacgtg cacaccgccc cccaaagtcg cgattgaagc cgtagagaac ggcgagatcc 1560 
agttcgtccc taaacagtac ggcaacttcg ttatcgotca ggcgaactcc aacctggatg 1620 
atgaaggccg ctttttagaa ggtgacttgt cgtagcaaag gcgaatcaag cctgtttagc 1680 
cacaactatg cgtgctcgtg gtgcacaagt aacggatatt acaatcattg ttgttgcagc 1740 
tgatgacggc gtaataaaca gttgaagcga ttaaccatgc gaaagcagca ggagtaccaa '1800 
ctttactcag cttgctggta tgcgtggtct gatggccaat ccatctggtc gtatcatcga 1860 
acttccaatc aagtttccgt gaaggtttaa cagtacttga gtacttcatc tctacgcatg 1920 
gtgcgcgtaa aggtcatggg agtaagacct acagtaagag gttctgtaat gaaccctaat 1980 
gaccatccac acggtggtgg tgaaggtaga tctcctatcg gaaagtccac gtactccatg 2040 
gggtaaacca gcacttggat acaaaacaag cgcagttcgg cggccagcgc ttcggtgaaa 2100 
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